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CELL SIGNALING PROTEINS 

TECHNICAL FIELD 
This invention relates to nucleic acid and amino acid sequences of cell signaling proteins 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative 
and inflammatory disorders. 

BACKGROUND OF THE INVENTION 

10 Signal transduction is the process of biochemical events by which cells respond to 

extracellular signals. Extracellular signals are transduced through a biochemical cascade that 
begins with the binding of a signal molecule such as a hormone, neurotransmitter, or growth 
factor, to a cell membrane receptor and ends with the activation of an intracellular target molecule. 
The process of signal transduction regulates a wide variety of cell functions including cell 

1 5 proliferation, differentiation, and gene transcription. 

Signal transduction is the general process by which cells respond to extracellular signals 
(hormones, neurotransmitters, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions that begins with the binding of the signaling molecule to a cell membrane 
receptor and ends with the activation of an intracellular target molecule. Intermediate steps in this 

20 process involve the activation of various cytoplasmic proteins by phosphorylation via protein 
kinases and the eventual translocation of some of these activated proteins to the cell nucleus where 
the transcription of specific genes is triggered. Thus, the signal transduction process regulates ail 
types of cell functions including cell proliferation, differentiation, and gene transcription. 

Protein kinases play a key role in the signal transduction process by phosphorylating and 

25 activating various proteins involved in signaling pathways. The high energy phosphate which 
drives this activation is generally transferred from adenosine triphosphate molecules (ATP) to a 
particular protein by protein kinases and removed from that protein by protein phosphatases. 
Phosphorylation occurs in response to extracellular signals, cell cycle checkpoints, and 
environmental or nutritional stresses. Protein kinases are roughly divided into two groups; those 

30 that phosphorylate tyrosine residues (protein tyrosine kinases, PTK) and those that phosphorylate 
serine or threonine residues (serine/threonine kinases, STK). A few protein kinases have dual 
specificity for serine/threonine and tyrosine residues. Almost all kinases contain a similar 250-300 
amino acid catalytic domain containing specific residues and sequence motifs characteristic of the 
kinase family. (Hardie, G. and Hanks, S. (1995) The Protein Kinase Facts Books. Vol 1:7-20 
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Academic Press, San Diego, CA.) 

STKs include the second messenger dependent protein kinases such as the cyclic-AMP 
dependent protein kinases (PKA), which are involved in mediating hormone-induced cellular 
responses; calcium-calmodulin (CaM) dependent protein kinases, which are involved in regulation 

5 of smooth muscle contraction, glycogen breakdown, and neurotransmission; and the mitogen- 
activated protein kinases (MAP) which mediate signal transduction from the cell surface to the 
nucleus via phosphorylation cascades. Altered PKA expression is implicated in a variety of 
disorders and diseases including cancer, thyroid disorders, diabetes, atherosclerosis, and 
cardiovascular disease. (Isselbacher, K.J, et al. (1994) Harrisons Principles of Internal Medicine . 

10 McGraw-Hill, New York, NY, pp. 416-431, 1887.) 

PTKs are divided into transmembrane, receptor PTKs and nontransmembrane, non- 
receptor PTKs. Transmembrane protein-tyrosine kinases are receptors for most growth factors 
which include epidermal GF, platelet-derived GF, fibroblast GF, hepatocyte GF, insulin and 
insulin-like GFs, nerve GF, vascular endothelial GF, and macrophage colony stimulating factor. 

15 Non-receptor PTKs lack transmembrane regions and, instead, form complexes with the 

intracellular regions of cell surface receptors. Receptors that function through non-receptor PTKs 
include those for cytokines, hormones (growth hormone and prolactin) and antigen-specific 
receptors on T and B lymphocytes. 

Many of these PTKs were first identified as the products of mutant oncogenes in cancer 

20 cells where their activation was no longer subject to normal cellular controls. In fact, about one 
third of the known oncogenes encode PTKs, and it is well known that cellular transformation 
(oncogenesis) is often accompanied by increased tyrosine phosphorylation activity. (Charbonneau 
H and Tonks NK (1992) Annu Rev Cell Biol 8:463-493.) 

Protein phosphatases regulate the effects of protein kinases by removing phosphate groups 

25 from molecules previously activated by kinases. The two principle categories of protein 

phosphatases are the protein phosphatases (PPs) and the protein tyrosine phosphatases (PTPs). 
PPs dephosphorylate phosphoserine/threonine residues and are important regulators of many 
cAMP-mediated hormone responses in cells. (Cohen, P. (1989) Annu. Rev. Biochem. 58:453- 
508.) PTPs reverse the effects of protein tyrosine kinases and play a significant role in cell cycle 

30 and cell signaling processes. (Charbonneau and Tonks, supra. ) In the process of cell division, for 
example, a specific PTP (M-phase inducer phosphatase) plays a key role in the induction of 
mitosis by dephosphorylating and activating a specific PTK (CDC2) leading to cell division. 
(Sadu, K.. et al. (1990) Proc. Natl. Acad. Sci. 87:5139-5143.) 

Guanine nucleotide binding proteins (GTP-binding proteins) are critical mediators of the 

35 signal transduction pathway. Extracellular ligands such as hormones, growth factors, 
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neuromodulators, or other signaling molecules bind to transmembrane receptors, and the signal is 
propagated to effector molecules by intracellular signal transducing proteins. Many of these signal 
transduction proteins are GTP-binding proteins which regulate intracellular signaling pathways. 
GTP-binding proteins participate in a wide range of other regulatory functions including 
5 metabolism, growth, differentiation, cytoskeletal organization, and intracellular vesicle transport 
and secretion. Exchange of bound GDP for GTP followed by hydrolysis of GTP to GDP provides 
the energy that enables GTP-binding proteins to alter their conformation and interact with other 
cellular components. Two structurally distinct classes of GTP-binding proteins are recognized: 
heterotrimeric GTP-binding proteins, consisting of three different subunits, and monomeric, low 

10 molecular weight (LMW), GTP-binding proteins consisting of a single polypeptide chain. 

G protein coupled receptors (GPCR) are a superfamily of integral membrane proteins 
which transduce extracellular signals. GPCRs include receptors for biogenic amines, mediators of 
inflammation, peptide hormones, and sensory signal mediators. A GPCR becomes activated when 
the receptor binds to its extracellular ligand. The beta subunit of the GPCR, which consists of an 

15 amino-terminal helical segment followed by seven WD, or p transducin repeats, transduces signals 
across the plasma membrane. Conformational changes in the GPCR, resulting from the ligand- 
receptor interaction, promote the binding of GTP to the GPCR intracellular domains. GTP binding 
to the GPCR leads to the interaction of the GPCR alpha subunit with adenylate cyclase or other 
second messenger molecule generators. This interaction regulates the activity of second 

20 messenger molecules such as cAMP, cGMP, or eicosinoids which, in turn, regulate 

phosphorylation and activation of other intracellular proteins. The GPCR changes conformation 
upon hydrolysis of the bound GTP by GTPases, dissociates from the second messenger molecule 
generator, and returns to its initial pre-ligand binding conformation. 

G beta proteins, also known as p transducins, contain seven tandem repeats of the WD- 

25 repeat sequence motif, a motif found in many proteins with regulatory functions. WD-repeat 
proteins contain from four to eight copies of a loosely conserved repeat of approximately 40 
amino acids which participates in protein-protein interactions. Mutations and variant expression 
of p transducin proteins are linked with various disorders. Mutations in LIS 1 , a subunit of the 
human platelet activating factor acetylhydrolase, cause Miller-Dieker lissencephaly, RACKl 

30 binds activated protein kinase C, and RbAp48 binds retinoblastoma protein. CstF is required for 
polyadenylation of mammalian pre-mRNA in vitro and associates with subunits of cleavage- 
stimulating factor. CD4, an integral membrane glycoprotein which functions as an HIV co- 
receptor for infection of human host cells is degraded by HlV-encoded Vpu in the endoplasmic 
reticulum. WD repeats of human beta TrCP molecule mediate the formation of the CD4- Vpu, 

35 inducing CD4 proteolysis (Neer, E.J. et al. (1 994) Nature 371 :297-300 and Margottin, F. et al. 
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(1998) Mol. Cell. 1:565-574). 

Irregularities in the GPCR signaling cascade may result in abnormal activation of 
leukocytes and lymphocytes, leading to the tissue damage and destruction seen in many 
inflammatory and autoimmune diseases such as rheumatoid arthritis, biliary cirrhosis, hemolytic 
5 anemia, lupus erythematosus, and thyroiditis. Abnormal cell proliferation, including cyclic AMP 
stimulation of brain, thyroid, adrenal, and gonadal tissue proliferation is regulated by G proteins. 
Mutations in subunits have been found in growth-hormone-secreting pituitary somatotroph 
tumors, hyperfunctioning thyroid adenomas, and ovarian and adrenal neoplasms (Meij, J.TA. 
(1996) Mol. Cell. Biochem. 157:31-38; Aussel, C. et al. (1988) J. Immunol. 140:215-220). 

10 LMW GTP-binding proteins regulate cell growth, cell cycle control, protein secretion, and 

intracellular vesicle interaction. They consist of single polypeptides which, like the alpha subunit 
of the heterotrimeric GTP-binding proteins, are able to bind to and hydrolyze GTP, thus cycling 
between an inactive and an active state. LMW GTP-binding proteins respond to extracellular 
signals from receptors and activating proteins by transducing mitogenic signals involved in 

1 5 various cell functions. The binding and hydrolysis of GTP regulates the response of LMW GTP- 
binding proteins and acts as an energy source during this process (Bokoch, G. M. and Der, C. J. 
(1993) FASEB J. 7:750-759). 

At least sixty members of the LMW GTP-binding protein superfamily have been 
identified and are currently grouped into the four subfamilies of ras, rho, arf, sari, ran, and rab. 

20 Activated ras genes were initially found in human cancers and subsequent studies confirmed that 
ras function is critical in determining whether ceils continue to grow or become differentiated. 
Other members of the LMW G-protein superfamily have roles in signal transduction that vary with 
the function of the activated genes and the locations of the GTP-binding proteins that initiate the 
activity. Rho GTP-binding proteins control signal transduction pathways that link growth factor 

25 receptors to actin polymerization, which is necessary for normal cellular growth and division. The 
rab, arf, and sari families of proteins control the translocation of vesicles to and from membranes 
for protein localization, protein processing, and secretion. Ran GTP-binding proteins are located 
in the nucleus of cells and have a key role in nuclear protein import, the control of DNA synthesis, 
and cell-cycle progression (Hall, A. (1990) Science 249:635-640; Barbacid, M. (1987) Ann. Rev 

30 Biochem. 56:779-827; and Sasaki, T. and Takai, Y. (1998) Biochem. Biophys. Res. Commun. 
245:641-645). 

LMW GTP-binding proteins are GTPases which cycle between a GTP-bound active form 
and a GDP-bound inactive form. This cycle is regulated by proteins that affect GDP dissociation, 
GTP association, or the rate of GTP hydrolysis. Proteins affecting GDP association are 
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represented by guanine nucleotide dissociation inhibitors and guanine nucleotide exchange factors 
(GEP). The best characterized is the mammalian homologue of the Drosophila Son-of-Sevenless 
protein. Proteins affecting GTP hydrolysis are exemplified by GTPase-activating proteins (GAP). 
Both GEP and GAP activity may be controlled in response to extracellular stimuli and modulated 
5 by accessory proteins such as RalBPl and POBl. The GDP-bound form is converted to the 
GTP-bound form through a GDP/GTP exchange reaction facilitated by guanine 
nucleotide-releasing factors. The GTP-bound form is converted to the GDP-bound form by 
intrinsic GTPase activity, and the conversion is accelerated by GAP (Ikeda, M. et al. (1998) J. 
Biol. Chem. 273:81 4-82 l;Quilliam, L. A. (1995) Bioessays 17:395-404.), Mutant Ras-faraily 
10 proteins, which bind but can not hydrolyze GTP, are permanently activated, and cause cell 

proliferation or cancer, as do GEP that activate LMW GTP-binding proteins (Drivas, G. T. et al. 
(1990) Mol. Cell. Biol. 10:1793-1798; and Whitehead, I. P. et al. (1998) Mol Cell Biol. 
18:4689-4697.) 

The discovery of new cell signaling proteins and the polynucleotides encoding them 
15 satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of cell proliferative and inflammatory disorders. 

SUMMARY OF THE INVENTION 

20 The invention features substantially purified polypeptides, cell signaling proteins, referred 

to collectively as "CSIGP" and individually as CSIGP-1 through CSIGP-13 . In one aspect, the 
invention provides a substantially purified polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof 

The invention further provides a substantially purified variant having at least 90% amino 

25 acid identity to at least one of the amino acid sequences selected from the group consisting of SEQ 
ID NO: 1-13, and fragments thereof The invention also provides an isolated and purified 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: I - 1 3, and fragments thereof The invention also includes an 
isolated and purified polynucleotide variant having at least 70% polynucleotide sequence identity 

30 to the polynucleotide encoding the polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1- 13, and fragments thereof 

Additionally, the invention provides an isolated and purified polynucleotide which 
hybridizes under stringent conditions to the polynucleotide encoding the polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID N0:l-13, and fragments 

35 thereof The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide encoding the polypeptide comprising the amino 



-5- 



wo 99/58558 



PCT/US99/I0567 



acid sequence selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof. 

The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 14-26, and fragments 
thereof. The invention further provides an isolated and purified polynucleotide variant having at 
5 least 70% polynucleotide sequence identity to the polynucleotide sequence selected from the 
group consisting of SEQ ID NO: 14-26 and fragments thereof. The invention also provides an 
isolated and purified polynucleotide having a sequence which is complementary to the 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 14-26 and fragments thereof. 

10 The invention also provides a method for detecting a polynucleotide in a sample 

containing nucleic acids, the method comprising the steps of (a) hybridizing the complement of the 
polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide in the sample. In one 

15 aspect, the method fiirther comprises amplifying the polynucleotide prior to hybridization. 

The invention further provides an expression vector containing at least a fragment of the 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-1 3 , and fragments thereof. In another aspect, the expression 
vector is contained within a host cell. 

20 The invention also provides a method for producing a polypeptide, the method comprising 

the steps of: (a) culturing the host cell containing an expression vector containing at least a 
fragment of a polynucleotide under conditions suitable for the expression of the polypeptide; and 
(b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially 

25 purified polypeptide having the amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-13, and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected 
from the group consisting of SEQ ID NO: 1-13, and fragments thereof. The invention also 
provides a purified agonist and a purified antagonist to the polypeptide. 

30 The invention also provides a method for treating or preventing a disorder associated with 

decreased expression or activity of CSIGP, the method comprising administering to a subject in 
need of such treatment an effective amount of a pharmaceutical composition comprising a 
substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-13, and fragments thereof, in conjunction with a suitable 

35 pharmaceutical carrier. 
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The invention also provides a method for treating or preventing a disorder associated with 
increased expression or activity of CSIGP, the method comprising administering to a subject in 
need of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof, in 
5 conjunction with a suitable pharmaceutical carrier. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows nucleotide and polypeptide sequence identification numbers (SEQ ID NO), 
clone identification numbers (clone ID), cDNA libraries, and cDNA fragments used to assemble 
10 full-length sequences encoding CSIGP. 

Table 2 shows features of each polypeptide sequence including potential motifs, 
homologous sequences, and methods and algorithms used for identification of CSIGP. 

Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 
determined by northern analysis, diseases, disorders or conditions associated with these tissues, 
15 and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which Incyte 
cDNA clones encoding CSIGP were isolated. 

Table 5 shows the programs, their descriptions, references, and threshold parameters used 
to analyze CSIGP. 

20 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 
25 purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
30 antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled 
in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 
35 herein can be used to practice or test the present invention, the preferred machines, materials and 
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methods are now described. All publications mentioned herein are cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 
5 prior invention. 
DEFINITIONS 

"CSIGP" refers to the amino acid sequences of substantially purified CSIGP obtained 
from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, 
equine, and preferably the human species, from any source, whether natural, synthetic, 

1 0 semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which, when bound to CSIGP, increases or 
prolongs the duration of the effect of CSIGP. Agonists may include proteins, nucleic acids, 
carbohydrates, or any other molecules which bind to and modulate the effect of CSIGP. 

An "allelic variant" is an alternative form of the gene encoding CSIGP. Allelic variants 

15 may result from at least one mutation in the nucleic acid sequence and may result in altered 
mRNAs or in polypeptides whose structure or function may or may not be altered. Any given 
natural or recombinant gene may have none, one, or many allelic forms. Common mutational 
changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in combination 

20 with the others, one or more times in a given sequence. 

"Altered" nucleic acid sequences encoding CSIGP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polynucleotide the same as 
CSIGP or a polypeptide with at least one functional characteristic of CSIGP. Included within this 
definition are polymorphisms which may or may not be readily detectable using a particular 

25 oligonucleotide probe of the polynucleotide encoding CSIGP, and improper or unexpected 
hybridization to allelic variants, with a locus other than the normal chromosomal locus for the 
polynucleotide sequence encoding CSIGP. The encoded protein may also be "altered," and may 
contain deletions, insertions, or substitutions of amino acid residues which produce a silent change 
and result in a functionally equivalent CSIGP. Deliberate amino acid substitutions may be made 

30 on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues, as long as the biological or immunological activity of CSIGP is 
retained. For example, negatively charged amino acids may include aspartic acid and glutamic 
acid, positively charged amino acids may include lysine and arginine, and amino acids with 
uncharged polar head groups having similar hydrophilicity values may include leucine, isoleucine, 

35 and valine; glycine and alanine; asparagine and glutamine; serine and threonine; and 
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phenylalanine and tyrosine. 

The terms ''amino acid" or ''amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. In this context, "fragments," "immunogenic fragments," or "antigenic 
5 fragments" refer to fragments of CSIGP which are preferably at least 5 to about 15 amino acids in 
length, most preferably at least 14 amino acids, and which retain some biological activity or 
immunological activity of CSIGP. Where "amino acid sequence" is recited to refer to an amino 
acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are 
not meant to limit the amino acid sequence to the complete native amino acid sequence associated 
10 with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PGR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which, when bound to CSIGP, decreases the 

15 amount or the duration of the effect of the biological or immunological activity of CSIGP. 

Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other molecules 
which decrease the effect of CSIGP. 

The term "antibody" refers to intact molecules as well as to fragments thereof, such as 
Fab, F(ab')2, and Fv fragments, which are capable of binding the epitopic determinant. Antibodies 

20 that bind CSIGP polypeptides can be prepared using intact polypeptides or using fragments 

containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 
RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, 

25 and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 
The term '^antigenic determinant" refers to that fragment of a molecule (i.e., an epitope) 
that makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (given regions or three-dimensional structures on 

30 the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which 
is complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules 
may be produced by any method including synthesis or transcription. Once introduced into a cell, 

35 the complementary nucleotides combine with natural sequences produced by the cell to form 
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duplexes and to block either transcription or translation. The designation "negative" can refer to 
the antisense strand, and the designation "positive" can refer to the sense strand. 

The term "biologically active," refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" 
5 refers to the capability of the natural, recombinant, or synthetic CSIGP, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence "5' A-G-T 3"' bonds to the 

10 complementary sequence "3' T-C-A 5*." Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and strength 
of the hybridization between the nucleic acid strands. This is of particular importance in 

15 amplification reactions, which depend upon binding between nucleic acids strands, and in the 
design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 

20 aqueous solution. Compositions comprising polynucleotide sequences encoding CSIGP or 
fragments of CSIGP may be employed as hybridization probes. The probes may be stored in 
freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In 
hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), 
detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, . 

25 dry milk, salmon sperm DNA, etc.). 

"Consensus sequence"refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or 
the 3* direction, and resequenced, or which has been assembled from the overlapping sequences of 
more than one Incyte Clone using a computer program for fragment assembly, such as the 

30 GELVIEW Fragment Assembly system (GCG, Madison WI). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

The term "correlates with expression of a polynucleotide" indicates that the detection of 
the presence of nucleic acids, the same or related to a nucleic acid sequence encoding CSIGP, by 
northern analysis is indicative of the presence of nucleic acids encoding CSIGP in a sample, and 

35 thereby correlates with expression of the transcript from the polynucleotide encoding CSIGP. 
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A "deletion'Vefers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
5 example, replacement of hydrogen by an alkyi, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide 
from which it was derived, 

10 The term "similarity" refers to a degree of complementarity. There may be partial 

similarity or complete similarity. The word "identity" may substitute for the word "similarity/' A 
partially complementary sequence that at least partially inhibits an identical sequence from 
hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of 
hybridization of the completely complementary sequence to the target sequence may be examined 

15 using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under 
conditions of reduced stringency. A substantially similar sequence or hybridization probe will 
compete for and inhibit the binding of a completely similar (identical) sequence to the target 
sequence under conditions of reduced stringency. This is not to say that conditions of reduced 
stringency are such that non-specific binding is permitted, as reduced stringency conditions 

20 require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence which 
lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). 
In the absence of non-specific binding, the substantially similar sequence or probe will not 
hybridize to the second non-complementary target sequence. 

25 The phrases "percent identity" or "% identity" refer to the percentage of sequence 

simiiarity found in a comparison of two or more amino acid or nucleic acid sequences. Percent 
identity can be determined electronically, e.g., by using the MEGALIGN program (DNASTAR, 
Madison WI). The MEGALIGN program can create alignments between two or more sequences 
according to different methods, e.g., the clustal method. (See, e.g., Higgins, D.G. and P.M. Sharp 

30 (1988) Gene 73:237-244.) The clustal algorithm groups sequences into clusters by examining the 
distances between all pairs. The clusters are aligned pairwise and then in groups. The percentage 
similarity between two amino acid sequences, e.g., sequence A and sequence B, is calculated by 
dividing the length of sequence A, minus the number of gap residues in sequence A, minus the 
number of gap residues in sequence B, into the sum of the residue matches between sequence A 

35 and sequence B, times one hundred. Gaps of low or of no similarity between the two amino acid 
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sequences are not included in determining percentage similarity. Percent identity between nucleic 
acid sequences can also be counted or calculated by other methods known in the art, e.g., the Jotun 
Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 183:626-645.) Identity between 
sequences can also be determined by other methods known in the art, e.g., by varying 
5 hybridization conditions. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may 
contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements 
required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
10 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to any process by which a strand of nucleic acid binds with a 
complementary strand through base pairing. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
15 sequences by virtue of the formation of hydrogen bonds between complementary bases. A 

hybridization complex may be formed in solution (e.g., Cot or R^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a 
solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have been fixed). 
20 The words "insertion" or "addition" refer to changes in an amino acid or nucleotide 

sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by 
25 expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which 
may affect cellular and systemic defense systems. 

The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 

The terms "element" or "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 
30 The term "modulate" refers to a change in the activity of CSIGP. For example, 

modulation may cause an increase or a decrease in protein activity, binding characteristics, or any 
other biological, functional, or immunological properties of CSIGP. 

The phrases "nucleic acid" or "nucleic acid sequence" refer to a nucleotide, 
oligonucleotide, polynucleotide, or any fragment thereof These phrases also refer to DNA or 
35 RNA of genomic or synthetic origin which may be single-stranded or double-stranded and may 
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represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-Iike or 
RNA-like material. In this context, "fragments" refers to those nucleic acid sequences which, 
when translated, would produce polypeptides retaining some functional characteristic, e.g., 
antigenicity, or structural domain characteristic, e.g., ATP-binding site, of the full-length 
5 polypeptide. 

The terms "operably associated" or "operably linked" refer to functionally related nucleic 
acid sequences. A promoter is operably associated or operably linked with a coding sequence if 
the promoter controls the translation of the encoded polypeptide. While operably associated or 
operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain 

10 genetic elements, e.g., repressor genes, are not contiguously linked to the sequence encoding the 
polypeptide but still bind to operator sequences that control expression of the polypeptide. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and most preferably about 20 
to 25 nucleotides, which can be used in PCR amplification or in a hybridization assay or 

15 microarray. "Oligonucleotide" is substantially equivalent to the terms "amplimer," "primer," 
"oligomer," and "probe," as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 

20 PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding CSIGP, or fragments thereof, or CSIGP itself, may comprise a bodily fluid; an 
extract from a cell, chroniosome, organelle, or membrane isolated from a cell; a cell; genomic 

25 DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" or "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, or an antagonist. The interaction is dependent upon 
the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 

30 presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a 

reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds 
to the antibody. 

The term "stringent conditions" refers to conditions which permit hybridization between 
polynucleotides and the claimed polynucleotides. Stringent conditions can be defined by salt 
35 concentration, the concentration of organic solvent, e.g., formamide, temperature, and other 
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conditions well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the hybridization 
temperature. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
5 removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably about 75% free, and most preferably about 90% free from other components with 
which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 
different amino acids or nucleotides, respectively. 
10 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation" describes a process by which exogenous DNA enters and changes a 
15 recipient cell. Transformation may occur under natural or artificial conditions according to 
various methods well known in the art, and may rely on any known method for the insertion of 
foreign nucleic acid sequences into a prokaryotic or eukaryotic host ceil. The method for 
transformation is selected based on the type of host cell being transformed and may include, but is 
not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment 
20 The term "transformed" cells includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 
limited periods of time. 

A "variant" of CSIGP polypeptides refers to an amino acid sequence that is altered by one 
25 or more amino acid residues. The variant may have "conservative" changes, wherein a substituted 
amino acid has similar structural or chemical properties (e.g., replacement of leucine with 
isoleucine). More rarely, a variant may have "nonconservative" changes (e.g., replacement of 
glycine with tryptophan). Analogous minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which amino acid residues may be substituted, 
30 inserted, or deleted without abolishing biological or immunological activity may be found using 
computer programs well known in the art, for example, LASERGENE software (DNASTAR). 

The term "variant," when used in the context of a polynucleotide sequence, may 
encompass a polynucleotide sequence related to CSIGP. This definition may also include, for 
example, "allelic" (as defined above), "splice," "species," or "polymorphic" variants. A splice 
35 variant may have significant identity to a reference molecule, but will generally have a greater or 
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lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional functional domains or an absence of domains. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A 
5 polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence 
of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for 
a disease state. 

10 THE INVENTION 

The invention is based on the discovery of new human cell signaling proteins (CSIGP), 
the polynucleotides encoding CSIGP, and the use of these compositions for the diagnosis, 
treatment, or prevention of cell proliferative and inflammatory disorders. 

Table 1 lists the Incyte Clones used to derive full length nucleotide sequences encoding 

15 CSIGP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NO) of the amino 
acid and nucleic acid sequences, respectively. Column 3 shows the Clone ID of the Incyte Clone 
in which nucleic acids encoding each CSIGP were first identified, and column 4, the cDNA 
libraries from which these clones were isolated. Column 5 shows Incyte clones, their 
corresponding cDNA libraries, and shotgun sequences useful as fragments in hybridization 

20 technologies, and which are part of the consensus nucleotide sequence of each CSIGP. 

The columns of Table 2 show various properties of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3, potential phosphorylation sites; column 4, potential glycosyiation sites; 
column 5, the amino acid residues comprising signature sequences and motifs; column 6, 

25 homologous sequences; and column 7, analytical methods used to identify each protein through 
sequence homology and protein motifs. 

The columns of Table 3 show the tissue-specificity and disease-association of nucleotide 
sequences encoding CSIGP. The first column of Table 3 lists the polynuclceotide sequence 
identifiers. The second column lists tissue categories which express CSIGP as a fraction of total 

30 tissue categories expressing CSIGP. The third column lists diseases, disorders, and conditiond 
associated with those tissues expressing CSIGP. The fourth column lists the vectors used to 
subclone the cDNA library. 

The following fragments of the nucleotide sequences encoding CSIGP are useful in 
hybridization or amplification technologies to identify SEQ ID NO: 14-26 and to distinguish 

35 between SEQ ID NO: 1 4-26 and similar polynucleotide sequences. Tlie useful fragments are the 
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fragment of SEQ ID NO: 14 from about nucleotide 135 to about nucleotide 189, the 
fragment of SEQ ID NO: 1 5 from about nucleotide 493 to about nucleotide 558, the 
fragment of SEQ ID NO: 16 from about nucleotide 11 70 to about nucleotide 1233, the 
fragment of SEQ ID NO: 1 7 from about nucleotide 939 to about nucleotide 996, the 
5 fragment of SEQ ID NO: 1 8 from about nucleotide 424 to about nucleotide 486, the 
fragment of SEQ ID NO: 19 from about nucleotide 274 to about nucleotide 333, and the 
fragment of SEQ ID NO:20 from about nucleotide 1013 to about nucleotide 1070, the 
fragment of SEQ ID NO:21 from about nucleotide 284 to about nucleotide 325, the fragment of 
SEQ ID NO:22 from about nucleotide 642 to about nucleotide 674, the fragment of SEQ ID 

10 NO:23 from about nucleotide 742 to about nucleotide 769, the fragment of SEQ ID NO:24 from 
about nucleotide 457 to about nucleotide 486, the fragment of SEQ ID NO:25 from about 
nucleotide 205 to about nucleotide 246, and the fragment of SEQ ID NO:26 fit>m about nucleotide 
3 19 to about nucleotide 342. 

The invention also encompasses CSIGP variants. A preferred CSIGP variant is one which 

15 has at least about 80%, more preferably at least about 90%, and most preferably at least about 95% 
amino acid sequence identity to the CSIGP amino acid sequence, and which contains at least one 
functional or structural characteristic of CSIGP. 

The invention also encompasses polynucleotides which encode CSIGP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence 

20 selected from the group consisting of SEQ ID NO: 14-26 which encodes CSIGP. 

The invention also encompasses a variant of a polynucleotide sequence encoding CSIGP. 
In particular, such a variant polynucleotide sequence will have at least about 70%, more preferably 
at least about 85%, and most preferably at least about 95% polynucleotide sequence identity to the 
polynucleotide sequence encoding CSIGP. A particular aspect of the invention encompasses a 

25 variant of a polynucleotide sequence comprising a sequence selected from the group consisting of 
SEQ ID NO: 14-26 which has at least about 70%, more preferably at least about 85%, and most 
preferably at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected 
from the group consisting of SEQ ID NO: 14-26. Any one of the polynucleotide variants described 
above can encode an amino acid sequence which contains at least one frmctional or structural 

30 characteristic of CSIGP 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding CSIGP, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
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sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring CSIGP, and ail such variations are to be 
considered as being specifically disclosed. 
5 Although nucleotide sequences which encode CSIGP and its variants are preferably 

capable of hybridizing to the nucleotide sequence of the naturally occurring CSIGP under 
appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
sequences encoding CSIGP or its derivatives possessing a substantially different codon usage, 
e.g., inclusion of non-naturally occurring codons. Codons may be selected to increase the rate at 

10 which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accoitlance 
with the frequency with which particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding CSIGP and its derivatives without altering 
the encoded amino acid sequences include the production of RNA transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the naturally 

15 occurring sequence. 

The invention also encompasses production of DNA sequences which encode CSIGP and 
CSIGP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 

20 introduce mutations into a sequence encoding CSIGP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO: 14-26 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. 
and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

25 1 52:507-5 1 1 .) For example, stringent salt concentration will ordinarily be less than about 750 mM 
NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium 
citrate, and most preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low 
stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while 
high stringency hybridization can be obtained in the presence of at least about 35% formamide, 

30 and most preferably at least about 50% formamide. Stringent temperature conditions will 

ordinarily include temperatures of at least about 30^C, more preferably of at least about 3TC, and 
most preferably of at least about 42°C. Varying additional parameters, such as hybridization time, 
the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion 
of carrier DNA, are well known to those skilled in the art. Various levels of stringency are 

35 accomplished by combining these various conditions as needed. In a preferred embodiment, 
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hybridization will occur at 30°C in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In a 
more preferred embodiment, hybridization will occur at 37°C in 500 mM NaCl, 50 mM trisodium 
citrate, 1% SDS, 35% formamide, and 100 /^g/ml denatured salmon sperm DNA (ssDNA). In a 
most preferred embodiment, hybridization will occur at 42°C in 250 mM NaCl, 25 mM trisodium 
5 citrate, 1% SDS, 50 % formamide, and 200 yug/ml ssDNA. Useful variations on these conditions 
will be readily apparent to those skilled in the art. 

The washing steps which follow hybridization can also vary in stringency. Wash 
stringency conditions can be defined by salt concentration and by temperature. As above, wash 
stringency can be increased by decreasing salt concentration or by increasing temperature. For 

10 example, stringent salt concentration for the wash steps will preferably be less than about 30 mM 
NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCl and 1 .5 mM 
trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include 
temperature of at least about 25'^C, more preferably of at least about 42°C, and most preferably of 
at least about 68°C. In a preferred embodiment, wash steps will occur at 25*='C in 30 mM NaCl, 3 

15 mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 
42^C in 15 mM NaCI, 1 .5 mM trisodium citrate, and 0.1% SDS. In a most preferred embodiment, 
wash steps will occur at 68''C in 15 mM NaCl, 1.5 mM trisodium citrate, and 0. 1 % SDS, 
Additional variations on these conditions will be readily apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to practice any 

20 of the embodiments of the invention. The methods may employ such enzymes as the Klenow 
fragment of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase 
(Perkin-Elmer), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 

25 preparation is automated with machines such as the HYDRA microdispenser (Robbins Scientific, 
Sunnyvale CA), MICROLAB 2200 (Hamilton, Reno NV), Peltier Thermal Cycler 200 (PTC200; 
MJ Research, Watertown MA) and the ABI CATALYST 800 (Perkin-Elmer). Sequencing is then 
carried out using either ABI 373 or 377 DNA Sequencing Systems (Perkin-Elmer) or the 
MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA). The 

30 resulting sequences are analyzed using a variety of algorithms which are well known in the art. 
(See, e.g., Ausubel, P.M. (1997) Short Protocols in Molecular Biology . John Wiley & Sons, New 
YoricNY, unit 7.7; Meyers, R.A. (1995) Molecular Biologv and Biotechnology . Wiley VCH, New 
YorkNY, pp. 856-853.) 

The nucleic acid sequences encoding CSIGP may be extended utilizing a partial 

35 nucleotide sequence and employing various PCR-based methods known in the art to detect 
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upstream sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal and nested primers to amplify unknown 
sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods 
Applic. 2:318-322.) Another method, inverse PCR, uses primers that extend in divergent 
5 directions to amplify unknown sequence from a circularized template. The template is derived 
from restriction fragments comprising a known genomic locus and surrounding sequences. (See, 
e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves 
PCR amplification of DNA fragments adjacent to known sequences in human and yeast artificial 
chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991 ) PCR Methods Applic. 1:11 1-119.) In 

10 this method, multiple restriction enzyme digestions and ligations may be used to insert an 

engineered double-stranded sequence into a region of unknown sequence before performing PCR. 
Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., 
Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR, 
nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic 

15 DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon 
junctions. For all PCR-based methods, primers may be designed using commercially available 
software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth MN) or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68^C to 72°C. 

20 When screening for full-length cDNAs, it is preferable to use libraries that have been 

size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a ftjll-length cDNA. Genomic libraries may be useftil for extension of 
sequence into 5' non-transcribed regulatory regions. 

25 Capillary electrophoresis systems which are commercially available may be used to 

analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, 
capillary sequencing may employ flowable polymers for electrophoretic separation, four different 
nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for 
detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal 

30 using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), 
and the entire process from loading of samples to computer analysis and electronic data display 
may be computer controlled. Capillary electrophoresis is especially preferable for sequencing 
small DNA fragments which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 

35 which encode CSIGP may be cloned in recombinant DNA molecules that direct expression of 
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CSIGP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the 
same or a functionally equivalent amino acid sequence may be produced and used to express 
CSIGP. 

S The nucleotide sequences of the present invention can be engineered using methods 

generally known in the art in order to alter CSIGP-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the 
gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments 
and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 
10 oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, and so forth. 

In another embodiment, sequences encoding CSIGP may be synthesized, in whole or in 
part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucl. 

15 Acids Res. Symp. Ser. 215-223, and Horn, T. et al. (1980) Nucl. Acids Res. Symp. Sen 225-232.) 
Alternatively, CSIGP itself or a fragment thereof may be synthesized using chemical methods. 
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 
Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using 
the ABI 431 A Peptide Synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of 

20 CSIGP, or any part thereof, may be altered during direct synthesis and/or combined with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392- 
421 .) The composition of the synthetic peptides may be confirmed by amino acid analysis or by 

25 sequencing. (See, e.g., Creighton, T, (1984) Proteins, Structures and Molecular Properties , WH 
Freeman, New York NY.) 

In order to express a biologically active CSIGP, the nucleotide sequences encoding 
CSIGP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for transcriptional and translational control of the inserted 

30 coding sequence in a suitable host. These elements include regulatory sequences, such as 

enhancers, constitutive and inducible promoters, and 5* and 3* untranslated regions in the vector 
and in polynucleotide sequences encoding CSIGP, Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding CSIGP. Such signals include the ATG initiation codon and adjacent 

35 sequences, e.g. the Kozak sequence. In cases where sequences encoding CSIGP and its initiation 

-20- 



wo 99/58558 



PCT/US99/I0567 



codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 
additional transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
5 translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 

10 expression vectors containing sequences encoding CSIGP and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 
Molecular Cloning, A Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, F.M, et al. ( 1 995) Current Protocols in Molecular Bioloev , John Wiley & Sons, 

15 NewYorkNY,ch.9, 13,andl6,) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding CSIGP. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral 

20 expression vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors 
(e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial 
expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. The invention is not 
limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected 

25 depending upon the use intended for polynucleotide sequences encoding CSIGP. For example, 
routine cloning, subcloning, and propagation of polynucleotide sequences encoding CSIGP can be 
achieved using a multifunctional E^coU vector such as PBLUESCRIPT (Stratagene, La Jolla CA) 
or pSPORTl plasmid (Life Technologies). Ligation of sequences encoding CSIGP into the 
vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure 

30 for identification of transformed bacteria containing recombinant molecules. In addition, these 
vectors may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with 
helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. 
and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of CSIGP are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of CSIGP 

35 may be used. For example, vectors containing the strong, inducible T5 or T7 bacteriophage 
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promoter may be used. 

Yeast expression systems may be used for production of CSIGP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, 
may be used in the veast Saccharomvces cerevisiae or Pichia oastoris . In addition, such vectors 
5 direct either the secretion or intracellular retention of expressed proteins and enable integration of 
foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra : 
Grant et al. (1987) Methods EnzymoL 153:516-54; and Scorer, C. A. et al. (1994) Bio/Technology 
12:181-184.) 

Plant systems may also be used for expression of CSIGP. Transcription of sequences 

10 encoding CSIGP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) 
EMBO J. 6:307-311). Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 
Brogiie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 

15 Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 

transformation or pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of 
Science and Technology H 992^ McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding CSIGP may be ligated 

20 into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
obtain infective virus which expresses CSIGP in host ceils. (See, e.g., Logan, J. and T. Shenk 
(1984) Proc. Natl. Acad. Sci. 81:3655-3659.) In addition, transcription enhancers, such as the 
Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 

25 SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments 
of DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb 
are constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat Genet. 

30 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable 
expression of CSIGP in cell lines is preferred. For example, sequences encoding CSIGP can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
35 vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 
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days in enriched media before being switched to selective media. The purpose of the selectable 
marker is to confer resistance to a selective agent, and its presence allows growth and recovery of 
cells which successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be propagated using tissue culture techniques appropriate to the cell type. 

5 Any number of selection systems may be used to recover transformed cell lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tkr or apr cells, respectively. (See, e.g., Wigler, M. et 
ah (1977) Cell 1 1:223-232; Lowy, 1. et al. (1980) Cell 22:817-823.) Also, antimetabolite, 
antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers 

10 resistance to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; 
and als or pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, 
respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-3570; 
Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14) Additional selectable genes have been 
described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., 

15 Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-8051 .) Visible markers, 
e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), 6 glucuronidase and its substrate 
6-glucuronide, or luciferase and its substrate luciferin may be used. These markers can be used 
not only to identify transformants, but also to quantify the amount of transient or stable protein 
expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. ( 1995) Methods Mol. 

20 Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. For 
example, if the sequence encoding CSIGP is inserted within a marker gene sequence, transformed 
cells containing sequences encoding CSIGP can be identified by the absence of marker gene 

25 function. Alternatively, a marker gene can be placed in tandem with a sequence encoding CSIGP 
under the control of a single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding CSIGP and that 
express CSIGP may be identified by a variety of procedures known to those of skill in the art. 

30 These procedures include, but are not limited to, DNA-DN A or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid or 
protein sequences. 

Immunological methods for detecting and measuring the expression of CSIGP using either 
35 specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
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include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on CSIGP is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. 
5 (See, e.g., Hampton, R. et ai. ( 1 990) Serological Methods, a Laboratory Manual . APS Press, St 
Paul MN, Sect. IV; Coligan, J. E. et al. (1997) Current Protocols in Immunolocv . Greene Pub. 
Associates and Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical 
Protocols .Humana Press, Totowa NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in the art 

10 and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding CSIGP 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled 
nucleotide. Alternatively, the sequences encoding CSIGP, or any fragments thereof, may be 
cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are 

15 commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 
Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, 

20 fluorescent, chemi luminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding CSIGP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the 

25 sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode CSIGP may be designed to contain signal 
sequences which direct secretion of CSIGP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications 

30 of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 

phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" 
form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-U-anslational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from 

35 the American Type Culture Collection (ATCC, Bethesda MD) and may be chosen to ensure the 
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correct modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding CSIGP may be iigated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric CSIGP protein 

5 containing a heterologous moiety that can be recognized by a commercially available antibody 
may facilitate the screening of peptide libraries for inhibitors of CSIGP activity. Heterologous 
protein and peptide moieties may also facilitate purification of fusion proteins using commercially 
available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase 
(GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6- 

10 His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification 
of their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, 
calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable 
immunoaffinity purification of fusion proteins using commercially available monoclonal and 
polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 

IS engineered to contain a proteolytic cleavage site located between the CSIGP encoding sequence 
and the heterologous protein sequence, so that CSIGP may be cleaved away from the heterologous 
moiety following purification. Methods for fusion protein expression and purification are 
discussed in Ausubel (1995, supra, ch 1 0). A variety of commercially available kits may also be 
used to facilitate expression and purification of fusion proteins. 

20 In a further embodiment of the invention, synthesis of radiolabeled CSIGP may be 

achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 
(Promega). These systems couple transcription and translation of protein-coding sequences 
operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of 
a radiolabeled amino acid precursor, preferably ^^S-methionine. 

25 Fragments of CSIGP may be produced not only by recombinant production, but also by 

direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra pp. 55-60.) 
Protein synthesis may be performed by manual techniques or by automation. Automated synthesis 
may be achieved, for example, using the ABI 43 1 A Peptide Synthesizer (Perkin- Elmer), Various 
fragments of CSIGP may be synthesized separately and then combined to produce the full length 

30 molecule. 



THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between CSIGP and cell signaling proteins. In addition, the expression of CSIGP is closely 
35 associated with cell proliferation and inflammatory disorders. Therefore, in cell proliferative and 



-25- 



wo 99/58558 



PCT/US99/I0567 



inflammatory disorders where CSIGP is an inhibitor or suppressor of cell proliferation, it is 
desirable to increase the expression of CSIGP. In cell proliferative and inflammatory disorders 
where CSIGP is an activator or enhancer and is promoting cell proliferation, it is desirable to 
decrease the expression of CSIGP. 

5 Therefore, in one embodiment, CSIGP or a fragment or derivative thereof may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CSIGP. Examples of such disorders include, but are not limited to, a ceil proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, 
mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 

10 polycythemia vera, psoriasis, primary thrombocythemia; cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers 
of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; and an inflammatory 

1 5 disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult 
respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema 

20 nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, 
Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 

25 thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma. 

In another embodiment, a vector capable of expressing CSIGP or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of CSIGP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially 
purified CSIGP in conjunction with a suitable pharmaceutical carrier may be administered to a 
subject to treat or prevent a disorder associated with decreased expression or activity of CSIGP 
including, but not limited to, those provided above. 

35 In still another embodiment, an agonist which modulates the activity of CSIGP may be 
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administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CSIGP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of CSIGP may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of CSIGP. Examples of 
5 such disorders include, but are not limited to, those described above. In one aspect, an antibody 
which specifically binds CSIGP may be used directly as an antagonist or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express 
CSIGP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
10 encoding CSIGP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of CSIGP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
15 therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergistically to effect the treatment 
or prevention of the various disorders described above. Using this approach, one may be able to 
achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

20 An antagonist of CSIGP may be produced using methods which are generally knovm in 

the art. In particular, purified CSIGP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind CSIGP. Antibodies to CSIGP may 
also be generated using methods that are well known in the art. Such antibodies may include, but 
are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, 

25 and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which 
inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with CSIGP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, various 

30 adjuvants may be used to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. 
Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum 
are especially preferable. 

35 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
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CSIGP have an amino acid sequence consisting of at least about 5 amino acids, and, more 
preferably, of at least about 1 0 amino acids. It is also preferable that these oligopeptides, peptides, 
or fragments are identical to a portion of the amino acid sequence of the natural protein and 
contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of 
5 CSIGP amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to CSIGP may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are 
not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 

10 hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. 
80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 

15 antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) 
Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and 
Takeda, S. et al. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the 
production of single chain antibodies may be adapted, using methods known in the art, to produce 
CSIGP-specific single chain antibodies. Antibodies with related specificity, but of distinct 

20 idiotypic composition, may be generated by chain shuffling from random combinatorial 

immunoglobulin libraries. (See, e.g.. Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents 
as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86: 

25 3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for CSIGP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 
produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing 
the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be 

30 constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity. (See, e.g., Huse, W.D. et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the 
desired specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well known in 

35 the art. Such immunoassays typically involve the measurement of complex formation between 
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CSIGP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non- interfering CSIGP epitopes is preferred, but a 
competitive binding assay may also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 

5 techniques may be used to assess the affinity of antibodies for ABBR. Affinity is expressed as an 
association constant, K,, which is defined as the molar concentration of ABBR-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium 
conditions. The K, determined for a preparation of polyclonal antibodies, which are 
heterogeneous in their affinities for multiple ABBR epitopes, represents the average affinity, or 

10 avidity, of the antibodies for ABBR. The determined for a preparation of monoclonal 

antibodies, which are monospecific for a particular ABBR epitope, represents a true measure of 
affmity. High-affinity antibody preparations with K. ranging from about 10^ to 10'^ L/mole are 
preferred for use in immunoassays in which the ABBR-antibody complex must withstand rigorous 
manipulations. Low-affinity antibody preparations with K, ranging from about 10^ to 10^ L/mole 

15 are preferred for use in immunopurification and similar procedures which ultimately require 

dissociation of ABBR, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, 
Volume I: A Practical Approach , IRL Press, Washington DC; Liddell, J. E. and Cryer, A. (1991) 
A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 

20 determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is preferred for use in procedures requiring precipitation 
of ABBR-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 
and guidelines for antibody quality and usage in various applications, are generally available. 

25 (See, e.g.. Catty, supra , and Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding CSIGP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding CSIGP may be used in situations in which it would be 
desirable to block the transcription of the mRNA. In particular, cells may be transformed with 

30 sequences complementary to polynucleotides encoding CSIGP. Thus, complementary molecules 
or fragments may be used to modulate CSIGP activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding CSIGP. 

35 Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, 
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or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art 
can be used to construct vectors to express nucleic acid sequences complementary to the 
polynucleotides encoding CSIGP. (See, e.g., Sam brook, sugra; Ausubel, 1995, sugra.) 
S Genes encoding CSIGP can be turned off by transforming a cell or tissue with expression 

vectors which express high levels of a polynucleotide, or fragment thereof, encoding CSIGP. 
Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. 
Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
molecules until they are disabled by endogenous nucleases. Transient expression may last for a 

10 month or more with a non-replicating vector, and may last even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 
regulatory regions of the gene encoding CSIGP. Oligonucleotides derived from the transcription 

15 initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. 

Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently 
for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. 

20 (1994) in Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches , Futura Publishing, 
Mt. Kisco NY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 
of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 

25 ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding CSIGP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: 

30 GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 

ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may 
be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by testing accessibility to hybridization 
with complementary oligonucleotides using ribonuclease protection assays. 

35 Complementary ribonucleic acid molecules and ribozymes of the invention may be 
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prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated bv in vitro 
and in vivo transcription of DNA sequences encoding CSIGP. Such DNA sequences may be 

5 incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or 
SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 

10 ends of the molecule, or the use of phosphorothioate or 2* 0-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine, and wybutosine, as well as 
/ acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and 

15 uridine which are not as easily recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo , in vitro > and ex vivo . For ex vivo therapy, vectors may be introduced into 
stem cells taken from the patient and clonally propagated for autologous transplant back into that 
same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers 

20 may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. 
(1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 

25 An additional embodiment of the invention relates to the administration of a 

pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist 
of CSIGP, antibodies to CSIGP, and mimetics, agonists, antagonists, or inhibitors of CSIGP. The 
compositions may be administered alone or in combination with at least one other agent, such as a 

30 stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical 
carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions 
may be administered to a patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

35 intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
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enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain 
suitable pharmaceuticalty-acceptable carriers comprising excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
5 phanmaceutically. Further details on techniques for formulation and administration may be found 
in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Phamaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
10 pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by 
the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 

15 excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, and sorbitol; starch from com, wheat, rice, potato, or other plants; cellulose, such as 
methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, 
including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, 

20 agar, and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 
sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 

25 product identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 

30 capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
oils, liquid, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 

35 substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
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cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be 
prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 
fatty oils, such as sesame oiK or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or 
liposomes. Non-Iipid polycationic amino polymers may also be used for delivery. Optionally, the 
5 suspension may also contain suitable stabilizers or agents to increase the solubility of the 
compounds and allow for the preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
10 manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and 
succinic acid. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
15 corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1% to 2% 
sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to 
use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
20 appropriate container and labeled for treatment of an indicated condition. For administration of 
CSIGP, such labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those skilled in the 
25 art. 

For any compound, the therapeutically effective dose can be estimated initially either in 
cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or 
pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes 

30 for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
CSIGP or fragments thereof, antibodies of CSIGP, and agonists, antagonists or inhibitors of 
CSIGP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 

35 such as by calculating the ED50 (the dose therapeutically effective in 50% of the population) or 
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LD50 (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, and it can be expressed as the LDjo/ ED50 ratio. Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from cell 
culture assays and animal studies are used to formulate a range of dosage for human use. The 
5 dosage contained in such compositions is preferably within a range of circulating concentrations 
that includes the EDso with little or no toxicity. The dosage varies within this range depending 
upon the dosage form employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 

10 the active moiety or to maintain the desired effect. Factors which may be taken into account 
include the severity of the disease state, the general health of the subject, the age, weight, and 
gender of the subject, time and frequency of administration, drug combination(s), reaction 
sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance 

1 5 rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 to 100,000 /ig, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 

20 inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 
DIAGNOSTICS 

in another embodiment, antibodies which specifically bind CSIGP may be used for the 
diagnosis of cell proliferative and inflammatory disorders characterized by expression of CSIGP, 
25 or in assays to monitor patients being treated with CSIGP or agonists, antagonists, or inhibitors of 
CSIGP. Antibodies useful for diagnostic purposes may be prepared in the same manner as 
described above for therapeutics. Diagnostic assays for CSIGP include methods which utilize the 
antibody and a label to detect CSIGP in human body fluids or in extracts of cells or tissues. The 
antibodies may be used with or without modification, and may be labeled by covalent or non- 
30 covalent attachment of a reporter molecule. A wide variety of reporter molecules, several of which 
are described above, are known in the art and may be used. 

A variety of protocols for measuring CSIGP, including ELISAs, RlAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of CSIGP 
expression. Normal or standard values for CSIGP expression are established by combining body 
35 fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to 
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CSIGP under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, preferably by photometric means. Quantities of 
CSIGP expressed in subject, control, and disease samples from biopsied tissues are compared with 
the standard values. Deviation between standard and subject values establishes the parameters for 
5 diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding CSIGP may be 
used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be 
used to detect and quantitate gene expression in biopsied tissues in which expression of CSIGP 

10 may be correlated with disease. The diagnostic assay may be used to determine absence, 
presence, and excess expression of CSIGP, and to monitor regulation of CSIGP levels during 
therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding CSIGP or closely related 

15 molecules may be used to identify nucleic acid sequences which encode CSIGP. The specificity 
of the probe, whether it is made from a highly specific region, e.g., the 5' regulatory region, or 
from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low), will determine whether the probe identifies 
only naturally occurring sequences encoding CSIGP, allelic variants, or related sequences. 

20 Probes may also be used for the detection of related sequences, and should preferably 

have at least 50% sequence identity to any of the CSIGP encoding sequences. The hybridization 
probes of the subject invention may be DNA or RNA and may be derived from the sequence of 
SEQ ID NO: 14-26 or from genomic sequences including promoters, enhancers, and introns of the 
CSIGP gene. 

25 Means for producing specific hybridization probes for DNAs encoding CSIGP include the 

cloning of polynucleotide sequences encoding CSIGP or CSIGP derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 

30 variety of reporter groups, for example, by radionuclides such as ^^P or ^^S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding CSIGP may be used for the diagnosis of cell 
proliferative and inflammatory disorders associated with expression of CSIGP. Examples of such 
disorders include, but are not limited to, a disorder of cell proliferation such as actinic keratosis, 

35 arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
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(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia; cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, 
bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
5 liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
testis, thymus, thyroid, and uterus; and an inflammatory disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 
allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 
hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, contact dermatitis, Crohn's 

10 disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia 
with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 

15 syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrom^, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma. The polynucleotide 
sequences encoding CSIGP may be used in Southern or northern analysis, dot blot, or other 

20 membrane-based technologies; in PGR technologies; in dipstick, pin, and ELISA assays; and in 
microarrays utilizing fluids or tissues from patients to detect altered CSIGP expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding CSIGP may be useful in assays 
that detect the presence of associated disorders, particularly those mentioned above. The 

25 nucleotide sequences encoding CSIGP may be labeled by standard methods and added to a fluid or 
tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. If the amount of signal in the patient sample is significantly 
altered in comparison to a control sample then the presence of altered levels of nucleotide 

30 sequences encoding CSIGP in the sample indicates the presence of the associated disorder. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
CSIGP, a normal or standard profile for expression is established. This may be accomplished by 

35 combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
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sequence, or a fragment thereof, encoding CSIGP, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
5 values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 
die patient begins to approximate that which is observed in the normal subject The results 

10 obtained from successive assays may be used to show the efficacy of treatment over a period 
ranging from several days to months. 

With respect to cancer, the presence of a relatively high amount of transcript in biopsied 
tissue from an individual may indicate a predisposition for the development of the disease, or may 
provide a means for detecting the disease prior to the appearance of actual clinical symptoms. A 

15 more definitive diagnosis of this type may allow health professionals to employ preventative 
measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
CSIGP may involve the use of PGR. These oligomers may be chemically synthesized, generated 

20 enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding CSIGP, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding CSIGP, and will be employed under optimized conditions for 
identification of a specific gene or condition. Oligomers may also be employed under less 
stringent conditions for detection or quantitation of closely related DN A or RNA sequences. 

25 Methods which may also be used to quantitate the expression of CSIGP include 

radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. 
Methods 1 59:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 229-236.) The speed of 
quantitation of multiple samples may be accelerated by running the assay in an ELISA format 

30 where the oligomer of interest is presented in various dilutions and a spectrophotometric or 
colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes simultaneously 

35 and to identify genetic variants, mutations, and polymorphisms. This information may be used to 
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determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and 
to develop and monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art (See, 
e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
5 Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, 
D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. 
94:2150-2155; and Heller, MJ. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding CSIGP may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic 

10 sequence. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, JJ. et al. (1997) 
Nat Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7: 127-134; and Trask, B.J. (1991) Trends 

15 Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data, (See, e.g., Heinz-Ulrich, et al. (1995) in 
Meyers, supra, pp. 965-968.) Examples of genetic map data can be found in various scientific 
journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between the 

20 location of the gene encoding CSIGP on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such 

25 as linkage analysis using established chromosomal markers, may be used for extending genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to chromosomal arms by physical 
mapping. This provides valuable information to investigators searching for disease genes using 

30 positional cloning or other gene discovery techniques. Once the disease or syndrome has been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
1 lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
fimher investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
sequence of the subject invention may also be used to detect differences in the chromosomal 

35 location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 
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In another embodiment of the invention, CSIGP, its catalytic or immunogenic fragments, 
or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of 
drug screening techniques. The fragment employed in such screening may be free in solution, 
affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of 
5 binding complexes between CSIGP and the agent being tested may be measured. 

Anodier technique for drug screening provides for high diroughput screening of 
compounds having suitable binding affinity to the protein of interest (See, e.g., Geysen, et al. 
(1984) PCT application WO84/03564.) In this method, large numbers of different small test 
compounds are synthesized on a solid substrate. The test compounds are reacted with CSIGP, or 
10 fragments thereof, and washed. Bound CSIGP is then detected by methods well known in the art. 
Purified CSIGP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support 

In another embodiment, one may use competitive drug screening assays in which 
15 neutralizing antibodies capable of binding CSIGP specifically compete with a test compound for 
binding CSIGP. In this manner, antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with CSIGP. 

In additional embodiments, the nucleotide sequences which encode CSIGP may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely 
20 on properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 
preceding description, utilize the present invention to its fullest extent. The following preferred 
specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of 
25 the remainder of the disclosure in any was whatsoever. 

The entire disclosure of all applications, patents, and publications, cited above and below, 
and of US provisional applications 60/085,343 (filed May 13, 1998), and 60/098,010 (filed 
August 26, 1998) are hereby incorporated by reference. 

EXAMPLES 

30 I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were 
homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting 
35 lysates were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated 
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from the lysates with either isopropanol or sodium acetate and ethanol, or by other routine 
methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was 
5 isolated using oiigo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(QIAGEN, Valencia GA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 

10 cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), 
using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 
1997, supra, units 5.1-6.6). Reverse transcription was initiated using oligo d(T) or random 
primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA 

15 was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA 
was size-selected (300-1000 bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B, or 
SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative 
agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), pSPORTl plasmid 

20 (Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids 
were transformed into competent E. coll cells including XLl-Blue, XLl-BIueMRF, or SOLR from 
Stratagene or DH5a, DHIOB, or ElectroMAX DHIOB from Life Technologies, 
n. Isolation of cDNA Clones 

Plasmids were recovered from host ceils by in vivo excision, using the UNIZAP vector 

25 system (Stratagene) or cell lysis. Plasmids were purified using at least one of the following: a 
Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep 
purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 
Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the REAL Prep 96 plasmid kit 
from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of distilled water 

30 and stored, with or without lyophilization, at 4''C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in 
a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and 
thermal cycling steps were carried out in a single reaction mixture. Samples were processed and 
stored in 3 84- well plates, and the concentration of amplified plasmid DNA was quantified 

35 fluorometricaily using PICOGREEN dye (Molecular Probes, Eugene OR) and a Fluoroskan II 
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fluorescence scanner (Labsystems Oy, Helsinki, Finland), 
in. Sequencing and Analysis 

The cDNAs were prepared for sequencing using either an ABI CATALYST 800 (Pericin- 
Elmer) or a HYDRA microdispenser (Robbins) or MICROLAB 2200 (Hamilton) sequencing 
5 preparation system in combination with PTC-200 thermal cyclers (MJ Research). The cDNAs 
were sequenced using the ABI PRISM 373 or 377 sequencing systems of the MEGABACE 1000 
DNA sequencing system (Molecular Dynamics) and ABI protocols, base calling software, and kits 
(Perkin-Elmer). Alternatively, solutions and dyes from Amersham Pharmacia Biotech were used. 
Reading frames were determined using standard methods (Ausubel, 1997, supra V Some of the 

10 cDNA sequences were selected for extension using the techniques disclosed in Example V. 

The polynucleotide sequences derived from cDNA, extension, and shotgun sequencing 
were assembled and analyzed using a combination of software programs which utilize algorithms 
well known to those skilled in the art. Table 5 summarizes the software programs, descriptions, 
references, and threshold parameters used. The first column of Table 5 shows the tools, programs, 

15 and algorithms used, the second column provides a brief description thereof, the third column 
presents the references which are incorporated by reference herein, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the probability the greater the 
homology). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 

20 Engineering, S. San Francisco CA) and LASERGENE software (DNASTAR). 

cDNAs were also compared to sequences in GenBank using a search algorithm developed 
by Applied Biosystems and incorporated into the INHERIT"* 670 sequence analysis system. In 
this algorithm. Pattern Specification Language (TRW Inc, Los Angeles, CA) was used to 
determine regions of homology. The three parameters that determine how the sequence 

25 comparisons run were window size, window offset, and error tolerance. Using a combination of 
these three parameters, the DNA database was searched for sequences containing regions of 
homology to the query sequence, and the appropriate sequences were scored with an initial value. 
Subsequently, these homologous regions were examined using dot matrix homology plots to 
distinguish regions of homology from chance matches. Smith- Waterman alignments were used to 

30 display the results of the homology search. 

Peptide and protein sequence homologies were ascertained using the INHERIT- 670 
sequence analysis system using the methods similar to those used in DNA sequence homologies. 
Pattern Specification Language and parameter windows were used to search protein databases for 
sequences containing regions of homology which were scored with an initial value. Dot-matrix 

35 homology plots were examined to distinguish regions of significant homology from chance 
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matches. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then 
5 queried against a selection of public databases such as GenBank primate, rodent, mammalian, 
vertebrate, and eukaryote databases, and BLOCKS to acquire annotation, using programs based 
on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide 
sequences using programs based on Phred, Phrap, and Consed, and were screened for open 
reading frames using programs based on GeneMark, BLAST, and FASTA. The fiill length 
10 polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against 
databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
PFAM, and Prosite. 

The programs described above for the assembly and analysis of full length polynucleotide 
15 and amino acid sequences were also used to identify polynucleotide sequence friagments from 
SEQ ID NO: 14-26. Fragments from about 20 to about 4000 nucleotides which are useful in 
hybridization and amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
20 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook. supra , ch. 7; 
Ausubel, 1995, supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 
related molecules in nucleotide databases such as GenBank or LIFESEQ database (Incyte 
25 Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In 
addition, the sensitivity of the computer search can be modified to determine whether any 
particular match is categorized as exact or similar. The basis of the search is the product score, 
which is defined as: % sequence identity x % maximum BLAST score 

100 

30 The product score takes into account both the degree of similarity between two sequences 

and the length of the sequence match. For example, with a product score of 40, the match will be 
exact within a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar 
molecules are usually identified by selecting those which show product scores between 15 and 40, 
although lower scores may identify related molecules. 

35 The results of northern analyses are reported a percentage distribution of libraries in which 
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the transcript encoding CSIGP occurred. Analysis involved the categorization of cDNA libraries 
by organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease or condition categories included cancer, 
5 inflammation/trauma, cell proliferation, neurological, and pooled. For each category, the number 
of libraries expressing the sequence of interest was counted and divided by the total number of 
libraries across all categories. Percentage values of tissue-specific and disease expression are 
reported in Table 3. 

V. Extension of CSIGP Encoding Polynucleotides 

10 The full length nucleic acid sequence of SEQ ID NO: 14-26 was produced by extension of 

an appropriate fragment of the full length molecule using oligonucleotide primers designed from 
this fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the 
other primer, to initiate 3* extension of the known fragment. The initial primers were designed 
using OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 

15 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
target sequence at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which 
would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

20 High fidelity amplification was obtained by PGR using methods well known in the art. 

PGR was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg-*, 
(NH4)2S04, and P-mercaptoethanoL Taq DNA polymerase (Amersham Pharmacia Biotech), 
ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 

25 following parameters for primer pair PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94 "^C, 1 5 sec; 
Step 3: 60X, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 
eS^'C, 5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ 
were as follows: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68X, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4'*C. 

30 The concentration of DNA in each well was determined by dispensing 100 |il 

PICOGREEN quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) 
dissolved in IX TE and 0.5 |il of undiluted PGR product into each well of an opaque fluorimeter 
plate (Coming Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was 
scanned in a Fluoroskan II (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the 

35 sample and to quantify the concentration of DNA. A 5 /^l to 10 a^I aliquot of the reaction mixture 
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was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were 
successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
5 sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 iigase (New England Biolabs, Beverly MA) into pUC 18 vector 
(Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in 

10 restriction site overhangs, and transfected into competent E. coli cells. Transformed cells were 
selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 
37*C in 384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfii DNA polymerase (Stratagene) with the following 

15 parameters: Step 1 : 94X, 3 min; Step 2: 94^C, 15 sec; Step 3: 60X, 1 min; Step 4: 72^G, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 
diluted with 20% dimethysulphoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer 

20 sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABl 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin- Elmer). 

In like manner, the nucleotide sequence of SEQ ID NO: 14-26 is used to obtain 5' 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, . 
and an appropriate genomic library. 

25 VI* Choice, Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 14-26 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 

30 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 f^Ci of 
[y-'^P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPontNEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10' counts per minute of the labeled probe is used in a typical membrane- 

35 based hybridization analysis of human genomic DNA digested with one of the following 
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endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xbal, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 

hours at 40*'C. To remove nonspecific signals, blots are sequentially washed at room temperature 
5 under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% sodium dodecyl 

sulfate. After XOMAT-AR film (Eastman Kodak, Rochester NY) is exposed to the blots to film 

for several hours, hybridization patterns are compared visually. 

VIL Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array 
10 elements on the surface of a substrate, (See, e.g., Baldeschweiler, sugra.) An array analogous to a 

dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 

thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by 

hand or using available methods and machines and contain any appropriate number of elements. 

After hybridization, nonhybridized probes are removed and a scanner used to determine the levels 
15 and patterns of fluorescence. The degree of complementarity and the relative abundance of each 

probe which hybridizes to an element on the microarray may be assessed through analysis of the 

scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
comprise the elements of the microarray. Fragments suitable for hybridization can be selected 

20 using software well known in the art such as LASERGENE software (DNASTAR). Full-length 
cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the 
present invention, or selected at random from a cDNA library relevant to the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., 
UV cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., 

25 Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) 
Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 
VIII. Complementary Polynucleotides 

Sequences complementary to the CSIGP-encoding sequences, or any parts thereof, are 

30 used to detect, decrease, or inhibit expression of naturally occurring CSIGP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides 
are designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of 
CSIGP. To inhibit transcription, a complementary oligonucleotide is designed from the most 

35 unique 5* sequence and used to prevent promoter binding to the coding sequence. To inhibit 
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translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 
CSIGP-encoding transcript 
DL Expression of CSIGP 

Expression and purification of CSIGP is achieved using bacterial or virus-based 
5 expression systems. For expression of CSIGP in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels 
of cDNA transcription. Examples of such promoters include, but are not limited to, the trp-lac 
(tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac 
operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, 

10 e.g., BL21(DE3). Antibiotic resistant bacteria express CSIGP upon induction with isopropyl beta- 
D-thiogalactopyranoside (IPTG). Expression of CSIGP in eukaryotic cells is achieved by 
infecting insect or mammalian cell lines with recombinant Autographica califomica nuclear 
polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin 
gene of baculovirus is replaced with cDNA encoding CSIGP by either homologous recombination 

15 or bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. 
Recombinant baculovirus is used to infect Spodoptera frugioerda {Sf9) insect cells in most cases, 
or human hepatocytes, in some cases. Infection of the latter requires additional genetic 
modifications to baculovirus. (See Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. USA 

20 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, CSIGP is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, 
single-step, afRnity-based purification of recombinant fusion protein from crude cell lysates. 
GST, a 26-kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion 

25 proteins on immobilized glutathione under conditions that maintain protein activity and 
antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be 
proteolyticaliy cleaved from CSIGP at specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoaffinity purification using commercially available monoclonal and 
polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine 

30 residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression 
and purification are discussed in Ausubel (1995. supra , ch 10 and 16). Purified CSIGP obtained 
by these methods can be used directly in the following activity assay. 
X. Demonstration of CSIGP Activity 

CSIGP activity can be assayed in vitro by monitoring the mobilization of Ca** as part of 

35 the signal transduction pathway. (See, e.g., Grynkievwicz, G. et al. ( 1 985) J. Biol. Chem. 
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260:3440; McColl, S. et al. (1993) J. Immunol. 150:4550-4555: and Aussel, C. et al. (1988) supra ) 
The assay requires preloading neutrophils or T cells with a fluorescent dye such as FURA-2 or 
BCECF (Universal Imaging Corp, Westchester PA) whose emission characteristics have been 
ahered by Ca** binding. When the cells are exposed to one or more activating stimuli artificially 
5 (ie, anti-CD3 antibody ligation of the T cell receptor) or physiologically (ie, by allogeneic 

stimulation), Ca"^ flux takes place. This flux can be observed and quantifled by assaying the cells 
in a fluorometer or fluorescent activated cell sorter. Measurements of Ca** flux are compared 
between cells in their normal state and those preloaded with CSIGP. 

Protein kinase activity in CSIGP is determined by measuring the phosphorylation of a 

10 protein substrate using gamma-labeled ^^P-ATP and quantitation of the incorporated radioactivity 
using a radioisotope counter. CSIGP is incubated with the protein substrate, ^-P-ATP, and an 
appropriate kinase buffer. The ^^P incorporated into the product is separated from free ^-P-ATP by 
electrophoresis and the incorporated ^-P is counted. The amount of ^-P recovered is proportional 
to the activity of CSIGP in the assay. A determination of the specific amino acid residue 

15 phosphorylated is made by phosphoamino acid analysis of the hydro lyzed protein. 

Protein phosphatase (PP) activity in CSIGP is determined by measuring the hydrolysis of 
P-nitrophenyl phosphate (PNPP). CSIGP is incubated together with PNPP in HEPES buffer pH 
7.5, in the presence of 0.1% b-mercaptoethanol at 37°C for 60 min. The reaction is stopped by the 
addition of 6 ml of 10 N NaOH and the increase in light absorbance at 41 0 nm resulting from the 

20 hydrolysis of PNPP is measured using a spectrophotometer. The increase in light absorbance is 
proportional to the activity of CSIGP in the assay. 
XI. Production of CSIGP Speciflc Antibodies 

CSIGP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is 

25 used to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the CSIGP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are 

30 well described in the art. (See, e.g., Ausubel, 1995, supra , ch. 1 1 .) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 431 A 
Peptide Synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, 
St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 

35 oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for 
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antipeptide activity by, for example, binding the peptide to plastic, blocking with 1% BSA, 
reacting with rabbit antisera, washing, and reacting with radio- iodinated goat anti-rabbit IgG. 
XII. Purification of Naturally Occurring CSIGP Using Specific Antibodies 

Naturally occurring or recombinant CSIGP is substantially purified by immunoaffmity 
5 chromatography using antibodies specific for CSIGP. An immunoaffmity column is constructed 
by covalently coupling anti-CSIGP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing CSIGP are passed over the immunoaffmity column, and the column is 

10 washed under conditions that allow the preferential absorbance of CSIGP (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/CSIGP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and CSIGP is collected. 
Xni. Identification of Molecules Which Interact with CSIGP 

15 CSIGP, or biologically active fragments thereof, are labeled with '-^I Bolton-Hunter 

reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate molecules previously 
arrayed in the wells of a multi-well plate are incubated with the labeled CSIGP, washed, and any 
wells with labeled CSIGP complex are assayed. Data obtained using different concentrations of 
CSIGP are used to calculate values for the number, affinity, and association of CSIGP with the 

20 candidate molecules. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 

25 to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
intended to be within the scope of the following claims. 
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What is claimed is: 

1 . A substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1 - 1 3, or a fragment thereof. 
5 2. A substantially purified variant having at least 90% amino acid identity to the 

amino acid sequence of claim 1 . 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1. 

4. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 3. 

10 5. An isolated and purified polynucleotide which hybridizes under stringent 

conditions to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 3. 

7. A method for detecting a polynucleotide, the method comprising the steps of: 
15 (a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in 

the sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of the polynucleotide in the sample. 

8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
20 hybridization. 

9. An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 14-26, or a fragment thereof 

10. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 9. 

25 11. An isolated and purified polynucleotide having a sequence which is 

complementary to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of 

claim 3. 

13. A host cell comprising the expression vector of claim 12. 

30 14. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 13 under conditions suitable for the 
expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim 1 in 
35 conjunction with a suitable pharmaceutical carrier. 
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1 6. A purified antibody which specifically binds to the polypeptide of claim 1 . 

17. A purified agonist of the polypeptide of claim 1 . 

1 8. A purified antagonist of the polypeptide of claim 1 . 

1 9. A method for treating or preventing a disorder associated with decreased 
expression of CSIGP, the method comprising administering to a subject in need of such treatment 
an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 
expression of CSIGP, the method comprising administering to a subject in need of such treatment 
an effective amount of the antagonist of claim 18. 
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SEQUENCE LISTING 



<110> INCYTE PHARMACEUTICALS, INC. 
BANDMAN, Olga 
HILLMAN, Jennifer L. 
LAL, Preeti 
YUE, Henry 
TANG, Y. Tom 
PATTERSON, Chandra 
BAUGHN, Mariah R. 
YANG, Junming 

<120> CELL SIGNALING PROTEINS 

<130> PF-0521 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/085,3434; 60/098,010 
<151> 1998-05-13; 1998-08-26 

<160> 26 

<170> PERL Program 

<210> 1 
<211> 418 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc-feature 

<223> Incyte Clone 016108 

<400> 1 



Met 


Ser 


Leu 


Leu 


Asp 


Cys 


Phe 


Cys 


Thr 


Ser 


Arg 


Thr Gin Val Glu 


1 








5 










10 




15 


Ser 


Leu 


Arg 


Pro 


Glu 


Lys 


Gin 


Ser 


Glu 


Thr 


Ser 


He His Gin Tyr 








20 










25 




30 


Leu 


Val 


Asp 


Glu 


Pro 


Thr 


Leu 


Ser 


Trp 


Ser 


Arg 


Pro Ser Thr Arg 










35 










40 




45 


Ala 


Ser 


Glu 


Val 


Leu 


Cys 


Ser 


Thr 


Asn 


Val 


Ser 


His Tyr Glu Leu 










50 








55 




60 


Gin 


Val 


Glu 


He 


Gly 


Arg 


Gly 


Phe 


Asp 


Asn 


Leu 


Thr Ser Val His 










65 










70 




75 


Leu 


Ala 


Arg 


His 


Thr 


Pro 


Thr 


Gly 


Thr 


Leu 


Val 


Thr He Lys He 








80 










85 




90 


Thr 


Asn 


Leu 


Glu 


Asn 


Cys 


Asn 


Glu 


Glu 


Arg 


Leu 


Lys Ala Leu Gin 










95 










100 




105 


Lys 


Ala 


Val 


He 


Leu 


Ser 


His 


Phe 


Phe 


Arg 


His 


Pro Asn He Thr 










110 










115 




120 


Thr 


Tyr 


Trp 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


Trp 


Leu Trp Val He 










125 










130 




135 


Ser 


Pro 


Phe 


Met 


Ala 


Tyr 


Gly 


Ser 


Ala 


Ser 


Gin 


Leu Leu Arg Thr 










140 








145 




150 


Tyr 


Phe 


Pro 


Glu 


Gly 


Met 


Ser 


Glu 


Thr 


Leu 


He 


Arg Asn He Leu 








155 










160 




165 


Phe 


Gly 


Ala 


Val 


Arg 


Gly 


Leu 


Asn 


Tyr 


Leu 


His 


Gin Asn Gly Cys 










170 










175 




180 


He 


His 


Arg 


Ser 


He 


Lys 


Ala 


Ser 


His 


He 


Leu 


He Ser Gly Asp 










185 










190 




195 


Gly 


Leu 


Val 


Thr 


Leu 


Ser 


Gly 


Leu 


Ser 


His 


Leu 


His Ser Leu Val 
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200 




205 


210 


Lys 


His 


Gly 


Gin Arg 


His 


Arg Ala Val Tyr Asp 


Phe Pro Gin Phe 




215 




220 


225 


Ser 


Thr 


Ser 


Val Gin 


Pro 


Trp Leu Ser Pro Glu 


Leu Leu Arg Gin 








230 




235 


240 


Asp 


Leu 


His 


Gly Leu 


Tyr 


Val Lys Ser Asp He 


Tyr Ser Val Gly 






245 




250 


255 


He 


Thr 


Ala 


Cys Glu 


Leu 


Ala Ser Gly Gin Val 


Pro Phe Gin Asp 








260 




265 


270 


Met 


His 


Arg 


Thr Gin 


Met 


Leu Leu Gin Lys Leu 


Lys Gly Pro Pro 






275 




280 


285 


Tyr 


Ser 


Pro 


Leu Asp 


He 


Ser He Phe Pro Gin 


Ser Glu Ser Arg 






290 




295 


300 


Met 


Lys 


Asn 


Ser Gin 


Ser 


Gly Val Asp Ser Gly 


He Gly Glu Ser 






305 




310 


315 


Val 


Leu 


Val 


Ser Ser Gly 


Thr His Thr Val Asn 


Ser Asp Arg Leu 








320 




325 


330 


His 


Thr 


Pro 


Ser Ser Lys 


Thr Phe Ser Pro Ala 


Phe Phe Ser Leu 








335 




340 


345 


Val 


Gin 


Leu 


Cys Leu 


Gin 


Gin Asp Pro Glu Lys Arg Pro Ser Ala 








350 




355 


360 


Ser 


Ser 


Leu 


Leu Ser 


His 


Val Phe Phe Lys Gin Met Lys Glu Glu 








365 




370 


375 


Ser 


Gin 


Asp 


Ser He 


Leu 


Ser Leu Leu Pro Pro Ala Tyr Asn Lys 






380 




385 


390 


Pro 


Ser 


lie 


Ser Leu 


Pro 


Pro Val Leu Pro Trp Thr Glu Pro Glu 








395 




400 


405 


Cys 


Asp 


Phe 


Pro Asp Glu 


Lys Asp Ser Tyr Trp Glu Phe 




410 




415 





<210> 2 
<211> 540 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc- feature 

<223> Incyte Clone 640521 

<400> 2 

Met Ser Ser Cys Val Ser Ser Gin Pro Ser Ser Asn Arg Ala Ala 

15 10 15 

Pro Gin Asp Glu Leu Gly Gly Arg Gly Ser Ser Ser Ser Glu Ser 

20 25 30 

Gin Lys Pro Cys Glu Ala Leu Arg Gly Leu Ser Ser Leu Ser He 

35 40 45 

His Leu Gly Met Glu Ser Phe He Val Val Thr Glu Cys Glu Pro 

50 55 60 

Gly Cys Ala Val Asp Leu Gly Leu Ala Arg Asp Arg Pro Leu Glu 

65 70 75 

Ala Asp Gly Gin Glu Val Pro Leu Asp Ser Ser Gly Ser Gin Ala 

80 85 90 

Arg Pro His Leu Ser Gly Arg Lys Leu Ser Leu Gin Glu Arg Ser 

95 100 105 

Gin Gly Gly Leu Ala Ala Gly Gly Ser Leu Asp Met Asn Gly Arg 

110 115 120 

Cys He Cys Pro Ser Leu Pro Tyr Ser Pro Val Ser Ser Pro Gin 

125 130 135 

Ser Ser Pro Arg Leu Pro Arg Arg Pro Thr Val Glu Ser His His 

140 145 150 

Val Ser He Thr Gly Met Gin Asp Cys Val Gin Leu Asn Gin Tyr 
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155 160 
Thr Leu Lys Asp Glu lie Gly Lys Gly Ser 

170 175 
Leu Ala Tyr Asn Glu Asn Asp Asn Thr Tyr 

185 190 
Leu Ser Lys Lys Lys Leu lie Arg Gin Ala 

200 205 
Pro Pro Pro Arg Gly Thr Arg Pro Ala Pro 

215 220 
Pro Arg Gly Pro lie Glu Gin Val Tyr Gin 

230 235 
Lys Lys Leu Asp His Pro Asn Val Val Lys 

245 250 
Asp Asp Pro Asn Glu Asp His Leu Tyr Met 

260 265 
Asn Gin Gly Pro Val Met Glu Val Pro Thr 

275 280 
Glu Asp Gin Ala Arg Phe Tyr Phe Gin Asp 

290 295 
Glu Tyr Leu His Tyr Gin Lys He He His 

305 310 
Ser Asn Leu Leu Val Gly Glu Asp Gly His 

320 325 
Phe Gly Val Ser Asn Glu Phe Lys Gly Ser 

335 340 
Asn Thr Val Gly Thr Pro Ala Phe Met Ala 

350 355 
Glu Thr Arg Lys He Phe Ser Gly Lys Ala 

365 370 
Met Gly Val Thr Leu Tyr Cys Phe Val Phe 

380 385 
Met Asp Glu Arg He Met Cys Leu His Ser 

395 400 
Ala Leu Glu Phe Pro Asp Gin Pro Asp He 

410 415 
Asp Leu He Thr Arg Met Leu Asp Lys Asn 

425 430 
Val Val Pro Glu He Lys Leu His Pro Trp 

440 445 
Ala Glu Pro Leu Pro Ser Glu Asp Glu Asn 

455 460 
Val Thr Glu Glu Glu Val Glu Asn Ser Val 

470 475 
Leu Ala Thr Val He Leu Val Lys Thr Met 

485 490 
Phe Gly Asn Pro Phe Glu Gly Ser Arg Arg 

500 505 
Ser Ala Pro Gly Asn Leu Leu Thr Lys Gin 

515 520 
Leu Gin Gly Thr Asp Pro Pro Pro Val Gly 

530 535 



165 

Tyr Gly Val Val Lys 
180 

Tyr Ala Met Lys Val 
195 

Gly Phe Pro Arg Arg 
210 

Gly Gly Cys He Gin 
225 

Glu He Ala He Leu 
240 

Leu Val Glu Val Leu 
255 

Val Phe Glu Leu Val 
270 

Leu Lys Pro Leu Ser 
285 

Leu He Lys Gly He 
300 

Arg Asp He Lys Pro 
315 

He Lys He Ala Asp 
330 

Asp Ala Leu Leu Ser 
345 

Pro Glu Ser Leu Ser 
360 

Leu Asp Val Trp Ala 
375 

Gly Gin Cys Pro Phe 
390 

Lys He Lys Ser Gin 
405 

Ala Glu Asp Leu Lys 
420 

Pro Glu Ser Arg He 
435 

Val Thr Arg His Gly 
450 

Cys Thr Leu Val Glu 
465 

Lys His He Pro Ser 
480 

He Arg Lys Arg Ser 
4 95 

Glu Glu Arg Ser Leu 
510 

Gly Ser Glu Asp Asn 

525 

Glu Glu Glu Val Leu 
540 



<210> 3 
<211> 729 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc-feature 

<223> Incyte Clone 1250171 
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<400> 3 

Met Gin Ser Thr Ser Asn His Leu Trp Leu Leu Ser Asp lie Leu 
15 10 15 

Gly Gin Gly Ala Thr Ala Asn Val Phe Arg Gly Arg His Lys Lys 

20 25 30 

Thr Gly Asp Leu Phe Ala He Lys Val Phe Asn Asn He Ser Phe 

35 40 45 

Leu Arg Pro Val Asp Val Gin Met Arg Giu Phe Glu Val Leu Lys 
50 55 60 

Lys Leu Asn His Lys Asn He Val Lys Leu Phe Ala He Glu Glu 
65 70 75 

Glu Thr Thr Thr Arg His Lys Val Leu He Met Glu Phe Cys Pro 
80 85 90 

Cys Gly Ser Leu Tyr Thr Val Leu Glu Glu Pro Ser Asn Ala Tyr 
95 100 105 

Gly Leu Pro Glu Ser Glu Phe Leu He Val Leu Arg Asp Val Val 

110 115 120 

Gly Gly Met Asn His Leu Arg Glu Asn Gly He Val His Arg Asp 

125 130 135 

He Lys Pro Gly Asn He Met Arg Val He Gly Glu Asp Gly Gin 

140 145 150 

Ser Val Tyr Lys Leu Thr Asp Phe Gly Ala Ala Arg Glu Leu Glu 

155 160 165 

Asp Asp Glu Gin Phe Val Ser Leu Tyr Gly Thr Glu Glu Tyr Leu 

170 175 180 

His Pro Asp Met Tyr Glu Arg Ala Val Leu Arg Lys Asp His Gin 

185 190 195 

Lys Lys Tyr Gly Ala Thr Val Asp Leu Trp Ser He Gly Val Thr 

200 205 210 

Phe Tyr His Ala Ala Thr Gly Ser Leu Pro Phe Arg Pro Phe Glu 

215 220 225 

Gly Pro Arg Arg Asn Lys Glu Val Met Tyr Lys He He Thr Gly 

230 235 240 

Lys Pro Ser Gly Ala He Ser Gly Val Gin Lys Ala Glu Asn Gly 

245 250 255 

Pro He Asp Trp Ser Gly Asp Met Pro Val Ser Cys Ser Leu Ser 

260 265 270 

Arg Gly Leu Gin Val Leu Leu Thr Pro Val Leu Ala Asn He Leu 

275 280 285 

Glu Ala Asp Gin Glu Lys Cys Trp Gly Phe Asp Gin Phe Phe Ala 

290 295 300 

Glu Thr Ser Asp He Leu His Arg Met Val He His Val Phe Ser 

305 310 315 

Leu Gin Gin Met Thr Ala His Lys He Tyr He His Ser Tyr Asn 

320 325 330 

Thr Ala Thr He Phe His Glu Leu Val Tyr Lys Gin Thr Lys He 

335 340. 345 

He Ser Ser Asn Gin Glu Leu He Tyr Glu Gly Arg Arg Leu Val 

350 355 360 

Leu Glu Pro Gly Arg Leu Ala Gin His Phe Pro Lys Thr Thr Glu 

365 370 375 

Glu Asn Pro He Phe Val Val Ser Arg Glu Pro Leu Asn Thr He 

380 385 390 

Gly Leu He Tyr Glu Lys He Ser Leu Pro Lys Val His Pro Arg 

395 400 405 

Tyr Asp Leu Asp Gly Asp Ala Ser Met Ala Lys Ala He Thr Gly 

410 415 420 

Val Val Cys Tyr Ala Cys Arg He Ala Ser Thr Leu Leu Leu Tyr 

425 430 435 

Gin Glu Leu Met Arg Lys Gly He Arg Trp Leu He Glu Leu He 

440 445 450 

Lys Asp Asp Tyr Asn Glu Thr Val His Lys Lys Thr Glu Val Val 

455 460 465 

He Thr Leu Asp Phe Cys He Arg Asn He Glu Lys Thr Val Lys 
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470 






Val 


Tyr 


Glu Lys 


Leu 


Met 


Lys 








485 






Gly Glu 


lie Ser 


Asp 


He 


His 








500 






Ser 


Gin 


Gly Thr 


lie 


Glu 


Thr 








515 






Leu 


Ser 


Pro Gly 


Gly Ser 


Leu 








530 






Gly Thr 


His Pro 


Lys 


Asp 


Arg 








545 






Leu 


Asn 


Cys Met 


Thr 


Glu 


He 






560 






Ala 


Glu 


Arg Arg 


Leu 


Ala 


Tyr 








575 






Asp 


Lys 


Gin Lys 


Leu 


Tyr 


Tyr 








590 






Phe 


Thr 


Asp Glu 


Cys 


Val 


Lys 








605 






Ser 


Glu 


Glu Trp 


He 


Arg 


Lys 








620 






Leu 


Ser 


Leu Thr 


Asn 


Gin 


Cys 








635 






Lys 


Tyr 


Gin Glu 


Tyr 


Thr 


Asn 








650 






Lys Met 


Phe Thr 


Ala 


Ser 


Ser 








665 






lie 


Tyr 


Pro Ser 


Ser 


Asn 


Thr 






680 






Lys 


Lys 


Leu Lys 


Glu 


Glu 


Met 








695 






Glu 


Asn 


Asn His 


He 


Leu 


Glu 








710 






Gly Gly 


Leu Arg 


Asn 


Val 


Asp 








725 







475 


480 


He Asn Leu Glu Ala Ala 


Glu Leu 


490 


495 


Thr Lys Leu Leu Arg Leu 


Ser Ser 


505 


510 


Ser Leu Gin Asp He Asp 


Ser Arg 


520 


525 


Ala Asp Ala Trp Ala His Gin Glu 


535 


540 


Asn Val Glu Lys Leu Gin 


Val Leu 


550 


555 


Tyr Tyr Gin Phe Lys Lys Asp Lys 


565 


570 


Asn Glu Glu Gin He His 


Lys Phe 


580 


585 


His Ala Thr Lys Ala Met 


Thr His 


595 


600 


Lys Tyr Glu Ala Phe Leu 


Asn Lys 


610 


615 


Met Leu His Leu Arg Lys 


Gin Leu 


625 


630 


Phe Asp He Glu Glu Glu 


Val Ser 


640 


645 


Glu Leu Gin Glu Thr Leu 


Pro Gin 


655 


660 


Gly He Lys His Thr Met 


Thr Pro 


670 


675 


Leu Val Glu Met Thr Leu Gly Met 


685 


690 


Glu Gly Val Val Lys Glu 


Leu Ala 


700 


705 


Arg Phe Gly Ser Leu Thr Met Asp 


715 


720 


Cys Leu 





<210> 4 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc-feature 

<223> Incyte Clone 1911587 

<400> 4 



Met 


Pro 


Gly 


Leu 


Leu 


Leu 


Cys Glu Pro Thr Glu Leu Tyr Asn 


He 


1 






5 




10 


15 


Leu 


Asn 


Gin 


Ala 


Thr 


Lys 


Leu Ser Arg Leu Thr Asp Pro Asn Tyr 










20 




25 


30 


Leu 


Cys 


Leu 


Leu 


Asp 


Val 


Arg Ser Lys Trp Glu Tyr Asp Glu 


Ser 








35 




40 


45 


His 


Val 


He 


Thr 


Ala 


Leu 


Arg Val Lys Lys Lys Asn Asn Glu 


Tyr 










50 




55 


60 


Leu 


Leu 


Pro 


Glu 


Ser 


Val 


Asp Leu Glu Cys Val Lys Tyr Cys 


Val 










65 




70 


75 


Val 


Tyr 


Asp 


Asn 


Asn 


Ser 


Ser Thr Leu Glu He Leu Leu Lys 


Asp 






80 




85 


90 


Asp 


Asp 


Asp 


Asp 


Ser 


Asp 


Ser Asp Gly Asp Gly Lys Asp Leu Val 








95 




100 


105 


Pro 


Gin 


Ala 


Ala 


He 


Glu 


Tyr Gly Arg He Leu Thr Arg Leu 


Thr 
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110 



His 


His 


Pro Val 


Tyr 
125 


lie 


Leu 


Gly 


Thr 


Tyr His 


Pne 
140 


Leu 


Arg 


Gin 


Glu 


Leu Asp 


TV 1 

Ala 
155 


Pne 


Gin 


Gly 


Lys 


Val Phe 


Val 
170 


Gly 


Asn 


He 


Gin 


Lys Asp 


Leu 
185 


Lys 


He 


Asp 


Thr 


Gly Pro 


Phe 


Pne 


TV 1 

Ala 






200 






He 


Arg 


He Glu 


Asp 


Ser 


Pro 






215 






Axg 


U 1 a 


neL v»ys 












230 






Val 


He 


Leu He 


Phe 
245 


Ser 


Thr 


Ala 


He 


He Ala 


Tyr 
260 


Leu 


Met 


Arg 


Ser 


Trp Ala 


Tyr 


Val 


Lys 






275 






Asn 


Arg 


Gly Leu 


Val 
290 


Ser 


Gin 


Leu 


Gly 


Asp Ser 


He 


Thr 


Asn 



305 







115 




120 


Lys 


Gly Gly Tyr Glu Arg 


Phe 


ser 






130 






Thr 


Gin 


Lys He He Trp Met 


Pro 






145 




IDU 


Pro 


Tyr 


Pro He Glu He 


Val 


Pro 




160 




165 


Phe 


Ser 


Gin Ala Cys Asp 


Pro 


Lys 






175 




180 


Lys 


Ala 


His Val Asn Val 


Ser 


Met 




190 




195 


Gly Asp Ala Asp Arg Leu 


Leu 


His 






205 




210 


Glu 


Ala 


Gin He Leu Pro 


Phe 


Leu 






220 




225 


Glu 


He 


His His His Leu Gly Ser 






235 




240 


Gin Gly 


He Ser Arg Ser 


Cys 


Ala 






250 




255 


His 


Ser 


Asn Glu Gin Thr 


Leu 


Gin 






265 




270 


Lys 


Cys 


Lys Asn Asn Met 


Cys 


Pro 






280 




285 


Leu 


Leu 


Glu Trp Glu Lys 


Thr 


He 






295 




300 


He 


Met 


Asp Pro Leu Tyr 










310 
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<400> 5 












Met 


Arg 


Asp 


Pro 


Leu 


Thr 


Asp Cys Pro Tyr Asn Lys Val Tyr Lys 


1 






5 




10 15 


Asn 


Leu 


Lys 


Glu 


Phe 


Ser 


Gin Asn Gly Glu Asn Phe Cys Lys Gin 








20 




25 30 


Val 


Thr 


Ser 


Val 


Leu 
35 


Gin 


Gin Arg Ala Asn Leu Glu He Ser Tyr 
40 45 


Ala 


Lys 


Gly 


Leu 


Gin 


Lys 


Leu Ala Ser Lys Leu Ser Lys Ala Leu 






50 




55 60 


Gin 


Asn 


Thr 


Arg 


Lys 
65 


Ser 


Cys Val Ser Ser Ala Trp Ala Trp Ala 
70 75 


Ser 


Glu 


Gly 


Met 


Lys 


Ser 


Thr Ala Asp Leu His Gin Lys Leu Gly 








80 




85 90 


Lys 


Ala 


He 


Glu 


Leu 


Glu 


Ala He Lys Pro Thr Tyr Gin Val Leu 








95 




100 . 105 


Asn 


Val 


Gin 


Glu 


Lys 
110 


Lys 


Arg Lys Ser Leu Asp Asn Glu Val Glu 
115 120 


Lys 


Thr 


Ala 


Asn 


Leu 


Val 


He Ser Asn Trp Asn Gin Gin He Lys 








125 




130 135 


Ala 


Lys 


Lys 


Lys 


Leu 


Met 


Val Ser Thr Lys Lys His Glu Ala Leu 




140 




145 150 


Phe 


Gin 


Leu 


Val 


Glu 
155 


Ser 


Ser Lys Gin Ser Met Thr Glu Lys Glu 
160 165 


Lys 


Arg 


Lys 


Leu 


Leu 


Asn 


Lys Leu Thr Lys Ser Thr Glu Lys Leu 
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170 






175 






180 


Glu 


Lys 


Glu Asp 


Glu 


Asn 


Tyr 


Tyr Gin Lys 


Asn Met Ala Gly 


Tyr 










185 






190 






195 


Ser 


Thr 


Arg Leu 


Lys 


Trp 


Glu 


Asn Thr Leu 


Glu Asn 


Cys Tyr 


Gin 










200 






205 






210 


Ser 


lie 


Leu 


Glu 


Leu 


Glu 


Lys 


Glu Arg lie 


Gin Leu 


Leu Cys 


Asn 










^lo 






220 






225 


Asn 


Leu 


Asn 


Gin 


Tyr 


Ser 


Gin 


His He Ser 


Leu Phe Gly Gin 


Thr 










o o n 
z JO 






o o c 






240 


Leu 


Thr 


Thr 


Cys 


His' 


Thr 


Gin 


He His Cys 


Ala He 


Ser Lys 


He 










z4D 






OCA 

zbO 






o c c 
Z33 


Asp 


lie 


Glu 


Lys 


Asp 


He 


Gin 


Ala Val Met 


Glu Glu 


Thr Ala 


He 










260 






265 






270 


Leu 


Ser 


Thr 


Glu 


Asn 


Lys 


Ser 


Glu Phe Leu 


Leu Thr Asp Tyr 


Phe 










275 






280 






285 


Glu 


Glu 


Asp 


Pro 


Asn 


Ser 


Ala 


Met Asp Lys 


Glu Arg Arg Lys 


Ser 










290 






295 






300 


Leu 


Leu 


Lys 


Pro 


Lys 


Leu 


Leu 


Arg Leu Gin 


Arg Asp 


He Glu 


Lys 










305 






310 






315 


Ala 


Ser 


Lys 


Asp 


Lys 


Glu 


Gly 


Leu Glu Arg 


Met Leu 


Lys Thr 


Tyr 










320 






325 






330 


Ser 


Ser 


Thr 


Ser 


Ser 


Phe 


Ser 


Asp Ala Lys 


Ser Gin 


Lys Asp 


Thr 










335 






340 






345 


Ala 


Ala 


Leu 


Met 


Asp 


Glu 


Asn 


Asn Leu Lys 


Leu Asp 


Leu Leu 


Glu 










350 






355 






360 


Ala 


Asn 


Ser Tyr 


Lys 


Leu 


Ser 


Ser Met Leu 


Ala Glu 


Leu Glu 


Gin 










365 






370 






375 


Arg 


Pro 


Gin 


Pro 


Ser 


His 


Pro 


Cys Ser Asn 


Ser He 


Phe Arg 


Trp 










3B0 






385 






390 


Axg 


Glu 


Lys 


Glu 


His 


Thr 


His 


Ser Tyr Val 


Lys He 


Ser Arg 


Pro 










395 






400 






405 


Phe 


Leu 


Met 


Lys 


Arg 


Leu 


Glu 


Asn He Val 


Ser Lys 


Ala Ser 


Ser 










410 






415 






420 


Gly Gly 


Gin 


Ser 


Asn 


Pro 


Gly 


Ser Ser Thr 


Pro Ala 


Pro Gly 


Ala 










425 






430 






435 


Ala 


Gin 


Leu 


Ser 


Ser 


Arg 


Leu 


Cys Lys Ala 


Leu Tyr 


Ser Phe 


Gin 










440 






445 






450 


Ala 


Arg 


Gin Asp 


Asp 


Glu 


Leu 


Asn Leu Glu 


Lys Gly Asp He 


Val 










455 






460 






465 


lie 


He 


His 


Glu 


Lys 


Lys 


Glu 


Glu Gly Trp 


Trp Phe 


Gly Ser 


Leu 










470 






475 






480 


Asn 


Gly 


Lys 


Lys 


Gly 


His 


Phe 


Pro Ala Ala 


Tyr Val 


Glu Glu 


Leu 










485 






490 






495 


Pro 


Ser 


Asn 


Ala 


Gly 


Asn 


Thr 


Ala Thr Lys 


Ala 














500 






505 
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Met Arg Lys Val Val Leu He 
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20 
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35 






Ala 


Ala 


Leu Leu 


Ala 


Ser 


His 








50 






Gin 


Val 


Asp Val 


Ser 


Asn 


Leu 






65 






Glu 


Leu 


Lys Gin 


Arg 


Phe 


Gin 








80 






Ala 


Gly 


He Met 


Pro 


Asn 


Pro 






95 






Phe Gly 


Leu Phe 


Ser 


Arg 


Lys 








110 






Glu Gly 


Leu Leu 


Thr 


Gin 


Gly 








125 






Gin 


Glu 


Val Phe 


Glu 


Thr 


Asn 








140 






Arg 


Glu 


Leu Glu 


Pro 


Leu 


Leu 






155 






Leu 


He 


Trp Thr 


Ser 


Ser 


Arg 








170 






Leu 


Glu 


Asp Phe 


Gin 


His 


Ser 






185 






Ser 


Lys 


Tyr Ala 


Thr 


Asp 


Leu 








200 






Phe 


Asn 


Gin Gin 


Gly 


Leu 


Tyr 








215 






Ala 


Leu 


Thr Asn 


Leu 


Thr 


Tyr 








230 




Thr 


Leu 


Leu Met 


Pro 


Ala 


He 








245 






Ala 


Phe 


Thr Leu 


Thr 


Pro 


Tyr 








260 






Leu 


Phe 


His Gin 


Lys 


Pro 


Glu 








275 






Leu 


Ser 


Ala Thr 


Thr 


Gly 


Phe 








290 






Lys 


Met 


Asp Leu 


Asp 


Glu 


Asp 








305 






Leu 


Leu 


Glu Leu 


Glu 


Lys 


His 








320 






Asp Asn 


Gin Ala 


Arg 


Leu 


Ser 








335 







<210> 7 

<211> 898 
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<220> 
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<223> Incyte Clone 2948818 

<400> 7 

Met Arg Lys Gly Val Leu Lys 

1 5 
Tyr Lys Asp Asp Pro Glu Glu 
20 

Gly His Gly Ser Phe Gly Ala 
35 

Thr Ser Glu Val Val Ala He 
50 

Gin Thr His Glu Lys Trp Gin 



40 










45 


Pro Thr Ala 


Glu 


Val 


Thr 


He 


Val 












bU 


Gin Ser Val 


Phe Arg Ala 


Ser 


Lys 


70 










ID 


Arg Leu Asp 


Cys 


He Tyr 


Leu 


Asn 












rO 


Gin Leu Asn 


He 


Lys 


Ala 


Leu 


Pne 


100 








105 


Val He His 


Met 


Phe 


Ser 


Thr 


Ala 


lie 
llD 










IzO 


Asp Lys He 


Thr 


Ala 


Asp Gly 


Leu 


130 










135 


Val Phe Gly 


His 


Phe 


He 


Leu 


He 


145 










150 


Cys His Ser 


Asp Asn 


Pro 


Ser 


Gin 


160 










165 


Ser Ala Arg 


Lys 


Ser 


Asn 


Phe 


Ser 


175 










180 


Lys Gly Lys 


Glu 


Pro Tyr 


Ser 


Ser 


190 










195 


Leu Ser Val 


Ala 


Leu 


Asn 


Arg 


Asn 


205 










210 


Ser Asn Val 


Ala 


Cys 


Pro 


Gly 


Thr 


220 










225 


Gly He Leu 


Pro 


Pro 


Phe 


He 


Trp 


235 










240 


Leu Leu Leu 


Arg 


Phe 


Phe 


Ala 


Asn 


250 








255 


Asn Gly Thr 


Glu 


Ala 


Leu 


Val 


Trp 


265 










270 


Ser Leu Asn 


Pro 


Leu 


He 


Lys 


Tyr 


280 










285 


Gly Arg Asn 


Tyr 


He 


Met 


Thr 


Gin 


295 










300 


Thr Ala Glu 


Lys 


Phe 


Tyr 


Gin 


Lys 


310 










315 


He Arg Val 


Thr 


He 


Gin 


Lys 


Thr 


325 










330 


Gly Ser Cys 


Leu 










340 












Asp Pro Glu 


He Ala Asp Leu 


Ser 


10 










15 


Leu Phe He 


Gly Leu His 


Glu 


He 


25 










30 


Val Tyr Phe 


Ala 


Thr 


Asn 


Ala 


His 


40 










45 


Lys Lys Met 


Ser Tyr 


Ser Gly 


Lys 


55 










60 


Asp He Leu 


Lys 


Glu 


Val 


Lys 


Phe 
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65 






Leu Arg 


Gin 


Leu 


Lys 


His 


Pro 








80 






Tyr Leu 


Lys 


Glu 


His 


Thr 


Ala 








95 






Gly Ser 


Ala 


Ser Asp 


Leu 


Leu 








110 






Gxu Val 


Glu 


He 


Ala 


TV T _ 

Ala 


lie 








125 






Ala Tyr 


Leu 


His 


Ser 


His 


Ala 






140 






Gly Asn 


He 


Leu 


Leu 


Thr 


Glu 








155 






Phe Gly 


Ser 


Ala 


Ser 


Met 


Ala 








170 






Thr Pro 


Tyr 


Trp Met 


Ala 


Pro 








185 






Gly Gin 


Tyr 


Asp Gly 


Lys 


Val 








200 






Cys lie 


Glu 


Leu 


Ala 


Glu 


Arg 








215 






Ala Met 


Ser 


Ala 


Leu 


Tyr 


His 








230 






Leu Gin 


Ser 


Asn 


Glu 


Trp 


Thr 








245 






Tyr Cys 


Leu 


Gin 


Lys 


He 


Pro 








260 






Leu Leu 


Arg 


His Asp 


Phe 


Val 








275 






Leu He 


Asp 


Leu 


He 


Gin 


Arg 








290 






Asp Asn 


Leu 


Gin Tyr 


Arg 


Lys 








305 






Thr Arg 


Asn 


Gly Pro 


Leu 


Asn 








320 






Ser Glu 


His 


Gly Thr 


Ser 


Leu 








335 






Ser Asn 


His 


Ser 


He 


Pro 


Ser 








350 






Ser Ser 


Ser 


Val 


Asn 


Ser 


Met 








365 






Ser Glu 


Leu 


Val 


Met 


Met 


His 








380 






Ser Ser 


Ser 


Val 


Val 


His 


Lys 








395 






Glu Ala 


Gly 


His 


Gly 


Asp 


Pro 








410 






Ser Val 


Gin 


Ser 


Gin 


Ala 


Leu 








425 






Ala Thr 


He 


Lys 


Ser 


Ala 


Ser 






440 






Glu 


Gin 


Glu 


Asn 


Glu 


Leu 








455 






Arg Met 


Arg 


Arg 


Gin 


His 


Gin 








470 






Lys Leu 


Lys 


Ala 


Glu 


Met 


Asp 








485 






Glu Val 


Glu 


Thr 


His 


Ala 


Asn 








500 






Leu Ala 


Lys 


Lys 


Gin 


Val 


Ala 








515 






Ala Ala 


Ala 


Asp Glu 


Lys 


Lys 








530 






Gin Lys 


Lys 


Asp 


Leu 


Thr 


Thr 



70 75 



Asn 


inr 


He 


Glu 


Tyr 


iiys 


Giy 


Cys 






85 












Trp 


Leu 


Val 


Met 


Glu 


Tyr 


Cys 


Leu 






100 










105 


Glu 


vai 


His 


Lys 


Lys 


Pro 


Leu 


Gin 






115 










IzO 


Tnr 


nxS 


Gly Ala 


Leu 


nlS 


Gly 


T All 

Leu 






130 










lo5 


Leu 


lie 


His 


Arg 


Asp 


lie 


Lys 


Ala 






145 










150 


Pro 


Gly 


Gin 


Val 


Lys 


Leu 


Ala 


Asp 






160 










165 


Ser 


Pro 


Ala 


Asn 


O A V 

ser 


Fne 


val 


Gly 






175 










180 


Glu 


Val 


He 


Leu 


Ala 


Met 


Asp 


Glu 






190 








195 


Asp 


Tin 

i±e 


Trp 


Ser 


Leu 


Gly 


lie 


Thr 






205 










210 


Lys 


Pro 


Pro 


Leu 


Pne 


Asn 


Met 


Asn 






220 










225 


He 


Ala 


Gin 


Asn 


Asp 


Ser 


Pro 


Thr 






235 










240 


Asp 


Ser 


Phe 


Arg 


Arg 


Phe 


Val 


Asp 






250 










255 


Gin 


Glu 


Arg 


Pro 


Thr 


Ser 


Ala 


Glu 






265 










270 


Arg 


Arg 


Asp Arg 


Pro 


Leu 


Arg 


Val 






280 










285 


Thr 


Lys 


Asp Ala 


Val 


Arg 


Glu 


Leu 






295 










300 


Met 


Lys 


Lys 


He 


Leu 


Phe 


Gin 


Glu 






310 










315 


Glu 


Ser 


Gin 


Glu 


Asp 


Glu 


Glu 


Asp 






325 








330 


Asn 


Arg 


Glu 


Met 


Asp 


Ser 


Leu 


Gly 






340 










345 


Met 


Ser 


Val 


Ser 


Thr 


Gly 


Ser 


Gin 






355 








360 


Gin 


Glu 


Val 


Met 


Asp 


Glu 


Ser 


Ser 






370 








375 


Asp 


Asp 


Glu 


Ser 


Thr 


T 1 « 

lie 


Asn 


Ser 






385 










390 


Lys 


Asp 


His 


Val 


Phe 


He 


Arg 


Asp 






400 










405 


Arg 


Pro 


Glu 


Pro 


Arg 


Pro 


Thr 


Gin 






415 










420 


His 


Tyr 


Arg Asn 


Arg 


Glu 


Arg 


Phe 






430 










4 Jo 


Leu 


Val 


Thr Arg 


Gin 


lie 


nXS 


Glu 






445 










>l C A 


Arg 


Glu 


Gin 


Met 


Ser 


Gly 


Tyr 


Lys 






4 60 










4do 


Lys 


Gin 


Leu 


He 


Ala 


Leu 


Glu 


Asn 






475 










480 


Glu 


His 


Arg Leu 


Lys 


Leu 


Gin 


Lys 






490 










495 


Asn 


Ser 


Ser 


He 


Glu 


Leu 


Glu 


Lys 






505 










510 


He 


He 


Glu 


Lys 


Glu 


Ala 


Lys 


Val 






520 










525 


Phe 


Gin 


Gin 


Gin 


He 


Leu 


Ala 


Gin 






535 










540 


Phe 


Leu 


Glu 


Ser 


Gin 


Lys 


Lys 


Gin 
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545 








550 






555 


Tyr Lys 


He Cys 


Lys 


Glu 


Lys 


He Lys Glu Glu Met Asn Glu Asp 










560 








565 






570 


His 


Ser 


Thr 


Pro 


Lys 


Lys 


Glu Lys Gin Glu Arg He Ser Lys His 










575 








580 






585 


Lys 


Glu 


Asn 


Leu 


Gin 


His 


Thr 


Gin Ala 


Glu Glu 


Glu Ala 


His Leu 








590 








595 






bUU 


Leu 


Thr 


Gin 


Gin 


Arg 


Leu 


Tyr 


Tyr Asp Lys Asn 


Cys 


Arg 


Phe Phe 










605 








610 






615 


Lys 


Arg 


Lys 


He 


Met 


He 


Lys 


Arg His 


Glu Val 


Glu 


Gin 


Gin Asn 










620 








625 






1 A 


He 


Arg 


Glu 


Glu 


Leu 


Asn 


Lys 


Lys Arg 


Thr Gin 


Lys 


Glu 


Met Glu 








635 








640 






645 


His 


Ala 


Met 


Leu 


He 


Arg 


His 


Asp Glu 


Ser Thr 


Arg 


Glu 


Leu Glu 










650 








655 






660 


Tyr 


Arg 


Gin 


Leu 


His 


Thr 


Leu 


Gin Lys 


Leu Arg 


Met 


Asp 


Leu He 








665 








670 






675 


Arg 


Leu 


Gin 


His 


Gin 


Thr 


Glu 


Leu Glu 


Asn Gin 


Leu 


Glu 


Tyr Asn 








680 








685 






690 


Lys 


Arg 


Arg 


Glu 


Arg 


Glu 


Leu 


His Arg 


Lys His 


Val 


Met 


Glu Leu 






695 








700 






705 


Arg 


Gin 


Gin 


Pro 


Lys 


Asn 


Leu 


Lys Ala 


Met Glu 


Met 


Gin 


He Lys 








710 








715 






720 


Lys 


Gin 


Phe 


Gin 


Asp Thr 


Cys 


Lys Val 


Gin Thr 


Lys 


Gin 


Tyr Lys 










725 








730 






735 


Ala 


Leu 


Lys Asn 


His 


Gin 


Leu Glu Val Thr Pro Lys Asn Glu His 










740 








745 






750 


Lys 


Thr 


He 


Leu 


Lys 


Thr 


Leu Lys Asp Glu Gin Thr Arg Lys Leu 








755 








760 






765 


Ala 


He 


Leu 


Ala 


Glu 


Gin 


Tyr Glu Gin Ser He 


Asn 


Glu 


Met Met 










770 








775 






780 


Ala 


Ser 


Gin 


Ala 


Leu Arg 


Leu Asp Glu Ala Gin Glu Ala Glu Cys 










785 








790 






795 


Gin 


Ala 


Leu Arg 


Leu 


Gin 


Leu 


Gin Gin 


Glu Met 


Glu 


Leu 


Leu Asn 










800 








805 






810 


Ala Tyr 


Gin 


Ser 


Lys 


He 


Lys 


Met Gin 


Thr Glu 


Ala 


Gin 


His Glu 










815 








820 






825 


Arg Glu 


Leu 


Gin 


Lys 


Leu 


Glu Gin Arg Val Ser Leu Arg Arg Ala 










830 








835 






840 


His 


Leu 


Glu 


Gin 


Lys 


He 


Glu 


Glu Glu 


Leu Ala 


Ala 


Leu 


Gin Lys 










845 








850 






855 


Glu Arg 


Ser 


Glu 


Arg 


He 


Lys 


Asn Leu Leu Glu Arg Gin Glu Arg 










860 








865 






870 


Glu 


He 


Glu 


Thr 


Phe Asp 


Met 


Glu Ser 


Leu Arg Met 


Gly Phe Gly 










875 








880 






885 


Asn 


Leu 


Val 


Thr 


Leu 


Asp 


Phe 


Pro Lys 


Glu Asp Tyr Arg 












890 








895 









<210> 8 

<211> 336 

<212> PRT 

<213> Homo sapiens 

<220> 
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<400> 8 

Met Ala Thr Leu Ser Val He Gly Ser Ser Ser Leu He Ala Tyr 
15 10 15 

Ala Val Phe His Asn He Gin Lys Ser Pro Glu He Arg Pro Leu 
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Phe Tyr 
Thr Glu 
lie Cys 
Ser Phe 
Leu Arg 
Val lie 
Ser Ser 
Gly Asn 
Cys lie 
Pro Ser 
lie Ala 
Met Val 
Lys Ser 
lie Val 
Cys Trp 
Pro Gin 
Leu Thr 
Trp Thr 
Asp Ala 
Tyr Ser 
Ala Ser 



20 

Leu Ser Phe 
35 

Thr Leu Leu 
50 

Tyr Asn Leu 
65 

Leu Tyr Thr 
80 

Met Lys His 
95 

Asp Tyr Thr 
110 

Leu lie Pro 
125 

Thr Ser Glu 
140 

Leu Met His 
155 

Ala Asn Thr 
170 

lie Phe Leu 
185 

Leu Leu lie 
200 

Thr Gly Phe 
215 

Asp Gin Arg 
230 

Gly Pro Ala 
245 

Asp Thr Lys 
260 

Ala Thr Ser 
275 

Gin His Lys 
290 

Asp Thr Gin 
305 

Arg Gly Leu 
320 

Thr Ser Thr 
335 



Cys Asp 
Tyr Gly 
Gin Ala 
Val Asn 
Thr Gin 
Cys Arg 
Leu Leu 
Cys Phe 
Ser Pro 
Ser Val 
Gly Ser 
Arg Ala 
Leu Gly 
Val Arg 
Val lie 
Leu His 
Gin Gly 
Phe His 
Thr Pro 
Asn Ser 
lie 



Leu Leu 
Ala Ser 
Val Gly 
Tyr lie 
Ser Gly 
Val Gly 
Leu Met 
Gin Asn 
Pro Ser 
Cys Ser 
Phe Val 
Gin Thr 
Ser Glu 
Phe Tyr 
Leu Met 
Met Ala 
Leu Leu 
Gin Leu 
Leu Leu 
Leu Glu 



25 

Leu Gly Leu 
40 

Val Ala Asn 

55 

Gin He Phe 
70 

Trp Tyr Leu 
85 

Gin Ser Thr 
100 

Gin Met Ala 
115 

Thr Pro Val 
130 

Phe Ser Gin 
145 

Ala Met Ala 
160 

Thr Leu Tyr 
175 

Leu Ser Leu 
190 

Leu Tyr Lys 
205 

Gin Trp Ala 
220 

Pro Val Ala 
235 

He lie Lys 
250 

Leu Tyr Val 
265 

Asn Cys Gly 
280 

Lys Gin Glu 
295 

Cys Ser Gin 
310 

Ser Thr Leu 

325 



30 

Cys Trp Leu 
45 

Lys Asp He 
60 

Tyr He Ser 
75 

Tyr Thr Glu 
90 

Ser Pro Leu 

105 

Phe Val Phe 
120 

Phe Cys Leu 
135 

Ser His Lys 
150 

Glu Leu Pro 
165 

Phe Tyr Gly 
180 

Leu Thr He 
195 

Lys Phe Val 
210 

Val He His 

225 

Phe Phe Cys 
240 

Leu Thr Lys 
255 

Leu Gin Ala 

270 

Val Tyr Gly 

285 

Ala Arg Arg 

300 

Lys Arg Phe 

315 

Thr Phe Pro 
330 



<210> 9 

<211> 686 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc-f eature 

<223> Incyte Clone 1403604 

<400> 9 

Met Gly Pro Arg Ser Arg Glu Arg Arg Ala Gly Ala Val Gin Asn 
15 10 15 

Thr Asn Asp Ser Ser Ala Leu Ser Lys Arg Ser Leu Ala Ala Arg 

20 25 30 

Gly Tyr Val Gin Asp Pro Phe Ala Ala Leu Leu Val Pro Glv Ala 

35 40 45 

Ala Arg Arg Ala Pro Leu He His Arg Gly Tyr Tyr Val Arg Ala 
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Arg Ala Val 
Ala Pro Gin 
Gly Phe Asp 
Ala Arg Ala 
Arg Lys Ala 
Thr Gly Pro 
Glu Ser Ala 
Gin Arg Val 
Ser Pro Thr 
Pro Glu Ser 
Pro Asn Ala 
Ala Phe Gly 
Pro Leu His 
Arg Arg Phe 
He Asn Glu 
Arg Val Glu 
Leu Lys Cys 
Thr Leu Ser 
Arg Val Asn 
Val Ser Ser 
Ala Ser Val 
Phe Gly Glu 
Leu Leu Ser 
Gly Ser Cys 
Thr Met Thr 
Arg Leu Ser 
Phe Phe Lys 
He Thr Lys 
Arg His Ser 
Phe Val Tyr 
Trp His Phe 
Val Glu Gly 





50 




Arg 


His 
65 


Cys 


Ala 


Ala 

80 


Leu 


Ser 


Leu 
95 


Tyr 


Ala 


Val 
110 


Trp 


Glu 


Arg 
125 


He 


Phe 


Glu 
140 


Arg 


Asp 


Tyr 
155 


Cys 


Glu 


Glu 
170 


Ala 


Leu 


Leu 
185 


Leu 


Ala 


Ala 
200 


Ala 


Leu 


Phe 
215 


Val 


Gin 


Phe 
230 


Met 


Gly 


Leu 
245 


Glu 


Leu 


Gin 
260 


Ala 


Phe 


Tyr 
275 


His 


Asn 


He 
290 


Glu 


Ala 


His 
305 


Tyr 


His 


Thr 
320 


Leu 


Pro 


Ala 
335 


Ser 


Glu 


Gly 
350 


Gin 


Phe 


Leu 
365 


Ser 


Gin 


Glu 
380 


Gly 


Arg 


Asp 
395 


Cys 


Gly 


Thr 
410 


Gly 


Arg 


Leu 
425 


Ser 


Pro 


Val 
440 


Ser 


Ser 


Glu 
455 


Asp 


Ala 


Gly 
470 


Arg 


Thr 


Thr 
485 


Glu 


Gly 


Gly 
500 


Arg 


Leu 


His 
515 


Val 


Glu 


Val 


Pro 



Val Arg Ala 
Arg Ala Gin 
Phe Arg Leu 
Glu Val Asp 
Gly Glu Thr 
Gly Glu Pro 
He Leu Gly 
Leu Gly Ala 
Ala Glu Ala 
Leu He Ala 
Val Tyr Glu 
Leu Gin His 
Arg Phe Pro 
Gly Trp Thr 
Cys Phe Leu 
Pro Phe Asp 
Phe He Leu 
Val Phe Pro 
Pro Ser Gly 
Val Pro Asn 
Pro Asp Val 
Arg His Cys 
Asp Ser Glu 
Val Gin Trp 
Glu Ser Arg 
Pro Ala Leu 
Asn Asn Thr 
Lys Asp Asp 
Val Ser Cys 
Ser Val Val 
Gly Thr Met 
Glu Ala Arg 



55 






Phe 


Leu 


Glu 


70 






He 


Leu 


Ser 


85 






Lys 


Thr 


Ala 


100 






Phe 


Pro 


Asp 


115 






Pro 


Glu 


Leu 


130 






Ala 


Ser 


Ala 


145 






Leu 


Asp 


Leu 


160 






Ala 


Gly 


Leu 


175 






Val 


Leu 


Thr 


190 






Trp 


Ala 


Ala 


205 






Gin 


Met 


Arg 


220 






Phe 


Arg 


Gin 


235 






Asp 


Val 


Glu 


250 






Ala 


Cys 


Gly 


265 






Pro 


Ala 


Glu 


280 






Glu 


Phe 


Glu 


295 






Ala 


Ala 


Ser 


310 






Ser 


Ser 


Glu 


325 






Val 


Phe 


Pro 


340 






Leu 


Lys 


Arg 


355 






He 


Leu 


Ser 


370 






Arg 


Val 


Ser 


385 






Trp 


Lys 


Gly 


400 






Asp 


Gly 


Arg 


415 






Val 


Leu 


Val 


430 






Gly 


Val 


Leu 


445 






Glu 


Asp 


Leu 


460 






Ser 


Thr 


Leu 


475 






Gin 


Asn 


Gin 


490 






Glu 


Pro 


Val 


505 






Ala 


Trp 


Val 


520 






His 


Ser 


His 







60 


Gin 


He 


Gly 
75 


Leu 


Gly 


Ala 

90 


Gly 


Arg 


Leu 
105 


Val 


Ala 


Arg 
120 


Cys 


Ala 


Leu 
135 


Leu 


Cys 


Phe 
150 


Arg 


Gin 


Leu 
165 


Asp 


Ala 


Ala 
180 


Tyr 


Leu 


Glu 
195 


Gin 


Arg 


Phe 
210 


Pro 


Gin 


T^p 
225 


Leu 


Asn 


Ser 
240 


Ala 


Gin 


Arg 
255 


Ala 


Val 


Asp 
270 


Glu 


Arg 


Arg 
285 


Glu 


Trp 


His 
300 


Arg 


Gly 


Asp 
315 


Ala 


Phe 


Pro 
330 


Ala 


Ser 


Val 
345 


Tyr 


Gly 


His 
360 


Ala 


Gly 


Gly 
375 


Gin 


Phe 


His 
390 


Ser 


Gin 


He 
405 


Leu 


Tyr 


His 
420 


Leu 


Gly 


Gly 
435 


Gin 


Leu 


His 
450 


Lys 


Val 


Thr 


Cys 


Cys 


Trp 
480 


Glu 


Tyr 


Leu 
495 


Leu 


Ser 


Asp 
510 


Arg 


He 


Pro 
525 


Ser 


Ala 


Cys 
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530 










535 


540 


Thr 


Trp 


Gin Gly 


Gly Ala 


Leu 


He Ala Gly Gly Leu Gly Ala Ser 






545 










550 


555 


Glu 


Glu 


Pro Leu 


Asn 


Ser 


Val 


Leu 


Phe Leu Arg Pro He Ser Cys 








560 














Gly 


Phe 


Leu Trp 


Glu 


Ser 


Val Asp 


He 


Gin 


Pro Pro He Thr Pro 






575 










580 


585 


Arg 


Tyr 


Ser His 


Thr 


Ala 


His 


Val 


Leu 


Asn 


Gly Lys Leu Leu Leu 




590 










595 


600 


Val 


Gly 


Gly He 


Trp 


He 


His 


Ser 


Ser 


Ser 


Phe Pro Gly Val Thr 






605 










610 


615 


Val 


lie 


Asn Leu 


Thr 


Thr 


Gly 


Leu 


Ser 


Ser 


Glu Tyr Gin He Asp 








620 










625 


630 


Thr 


Thr 


Tyr Val 


Pro Trp 


Pro 


Leu 


Met 


Leu 


His Asn His Thr Ser 






635 










640 


645 


He 


Leu 


Leu Pro 


Glu 


Glu 


Gin 


Gin 


Leu 


Leu 


Leu Leu Gly Gly Gly 








650 










655 


660 


Gly 


Asn 


Cys Phe 


Ser 


Phe 


Gly Thr Tyr 


Phe 


Asn Pro His Thr Val 






665 










670 


675 


Thr 


Leu 


Asp Leu 


Ser 


Ser 


Leu 


Ser Ala Gly Gin 






680 










685 





<210> 10 

<211> 519 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1652936 

<400> 10 

Met Met Ser Lys Asn Asp Gly Glu He Arg Phe Gly Asn Pro Ala 

15 10 15 

Glu Leu His Gly Thr Lys Val Gin He Pro Tyr Leu Thr Thr Glu 

20 25 30 

Lys Asn Ser Phe Lys Arg Met Asp Asp Glu Asp Lys Gin Glu Thr 

35 40 45 

Gin Ser Pro Thr Met Ser Pro Leu Ala Ser Pro Pro Ser Ser Pro 

50 55 60 

Pro His Tyr Gin Arg Val Pro Leu Ser His Gly Tyr Ser Lys Leu 

65 70 75 

Arg Ser Ser Ala Glu Gin Met His Pro Ala Pro Tyr Glu Ala Arg 

80 85 90 

Gin Pro Leu Val Gin Pro Glu Gly Ser Ser Ser Gly Gly Pro Gly 

95 100 105 

Thr Lys Pro Leu Arg His Gin Ala Ser Leu He Arg Ser Phe Ser 

110 115 120 

Val Glu Arg Glu Leu Gin Asp Asn Ser Ser Tyr Pro Asp Glu Pro 

125 130 135 

Trp Arg He Thr Glu Glu Gin Arg Glu Tyr Tyr Val Asn Gin Phe 

140 145 150 

Arg Ser Leu Gin Pro Asp Pro Ser Ser Phe He Ser Gly Ser Val 

155 160 165 

Ala Lys Asn Phe Phe Thr Lys Ser Lys Leu Ser He Pro Glu Leu 

170 175 180 
Ser Tyr He Trp Glu Leu Ser Asp Ala Asp Cys Asp Gly Ala Leu 

185 190 195 

Thr Leu Pro Glu Phe Cys Ala Ala Phe His Leu He Val Ala Arg 

200 205 210 
Lys Asn Gly Tyr Pro Leu Pro Glu Gly Leu Pro Pro Thr Leu Gin 
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225 


Pro Glu Tyr Leu 


6in Ala 


Ala 


Phe 


rro 


Lys 


Pro Lys 


Trp Asp Cys 


























240 


Gin Leu 


Phe Asp 


Ser Tyr 


Ser 


Glu 


oer 


Leu 


Pro 


Ala 

AX a 


Asn 


Gin 


Gin 


























255 


Pro Arg Asp Leu 


Asn Arg 


Met 


Glu 


inr 




Val 


Lys 


Asp Met Ala 


























270 


Asp Leu 


Pro 


Val 


Pro Asn 


Gin Asp 


vax 


1 nr 


Ser 


Asp 


Asp Lys 


Gin 






one. 








ZD U 










285 


Ala Leu 


Lys 


Ser 


Tiir i±e 


Asn 


Glu 


Ala 


Leu 


Pro 


Lys 


Asp Val 


Ser 














295 










300 


Glu Asp 


Pro 


Ala 


Thr Pro 


Lys 


Asp 


Ser 


Asn 


Ser 


Leu 


Lys Ala Arg 














310 










JlD 


Pro Arg 


Ser Arg 


Ser Tyr 


Ser 


Ser 


Thr 


Ser 


He 


Glu 


Glu 


Ala 


Met 






0 0 n 








325 










330 


Lys Arg Gly Glu 


Asp Pro 


Pro 


Thr 


Pro 


Pro 


Pro Arg 


Pro 


Gin 


Lys 








J JO 








340 










345 


Thr His 


Ser Arg 


Ala Ser 


Ser 


Leu Asp 


Leu 


Asn 


Lys 


Val 


Phe 


Gin 
















355 










360 


Pro Ser 


Val 


Pro 


Ala Thr 


Lys 


Ser .Gly Leu 


Leu 


Pro 


Pro 


Pro 


Pro 








^ f c 

355 








370 










375 


Ala Leu 


Pro 


Pro 


Arg Pro 


Cys 


Pro 


Ser 


Gin 


Ser 


Glu 


Gin 


Val 


Ser 
















385 










390 


Glu Ala 


Glu 


Leu 


Leu Pro 


Gin 


Leu 


Ser 


Arg 


Ala 


Pro 


Ser 


Gin 


Ala 
















400 










405 


Ala Glu 


Ser 


Ser 


Pro Ala 


Lys 


Lys 


Asp 


Val 


Leu 


Tyr 


Ser 


Gin 


Pro 








410 








415 










420 


Pro Ser 


Lys 


Pro 


lie Arg 


Arg 


Lys 


Phe 


Arg 


Pro 


Glu 


Asn 


Gin 


Ala 








425 








430 










435 


Thr Glu 


Asn 


Gin 


Glu Pro 


Ser 


Thr 


Ala 


Ala 


Ser 


Gly 


Pro 


Ala 


Ser 








440 








445 










450 


Ala Ala 


Thr 


Met 


Lys Pro 


His 


Pro 


Thr 


Val 


Gin Lys 


Gin 


Ser 


Ser 








455 








460 










465 


Lys Gin 


Lys 


Lys 


Ala He 


Gin 


Thr 


Ala 


He Arg Lys 


Asn Lys 


Glu 


470 








475 










480 


Ala Asn 


Ala 


val 


Leu Ala 


Arg Leu Asn 


Ser 


Glu 


Leu 


Gin 


Gin 


Gin 








485 








490 










495 


Leu Lys 


Glu 


Val 


His Gin 


Glu Arg 


He 


Ala 


Leu 


Glu 


Asn 


Gin 


Leu 






500 








505 










510 


Glu Gin Leu Arg 


Pro Val 


Thr 


Val 


Leu 




















515 





















<210> 11 

<211> 334 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1710702 

<400> 11 

Met Phe Arg Trp Glu Arg Ser He Pro Leu Arg Gly Ser Ala Ala 
15 10 15 

Ala Leu Cys Asn Asn Leu Ser Val Leu Gin Leu Pro Ala Arg Asn 

20 25 30 

Leu Thr Tyr Phe Gly Val Val His Gly Pro Ser Ala Gin Leu Leu 

35 40 45 

Ser Ala Ala Pro Glu Gly Val Pro Leu Ala Gin Arg Gin Leu His 

50 55 60 

Ala Lys Glu Gly Ala Gly Val Ser Pro Pro Leu He Thr Gin Val 
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His Trp 


Cys Val 


Leu 


Pro 


Phe 






80 






His Arg 


Gly He 


Gin 


Met 


Tyr 






95 






Tyr Trp 


His Ala 


Leu 


Asp 


Ser 






110 






Val Phe 


Ala Arg 


Gly 


lie 


Ala 






125 






Gly Thr 


Trp Ser 


Gly Arg 


Val 






140 






Gly Pro 


Asn He 


Val 


Leu 


Ser 




155 






Pro He 


Thr Asp 


He 


Ala 


Thr 






170 






Val Ala 


Asp Met 


Val 


Thr 


Ala 




185 






Trp Arg 


Ser Gly 


Pro 


Glu 


Phe 






200 






Phe Gly 


Val Pro 


Cys 


Pro 


Ser 




215 






Ala Ala 


Gly Tyr 


Gly Asn 


Gly 






230 






Thr Gly 


Asn Leu 


His 


Val 


Gin 




245 






Cys Ala 


Leu Asp 


Leu 


Ala 


Ser 




260 






Gly Glu 


Asp Thr 


Phe 


Val 


His 




275 






Glu Ser 


Gly Tyr 


He 


Glu 


Val 






290 






Ala Asp 


Thr Gin 


Leu 


Cys 


Gly 




305 






Asn Ser 


Phe Ala 


Val 


Thr 


Gly 






320 






Phe Ser 


Ser Val 











70 








75 


Arg Val Leu Leu Val 


Leu 


Thr 


Ser 




85 








90 


Glu Ser Asn Gly Tyr Thr Met 


Val 




100 








105 


Gly Asp Ala Ser 


Pro 


Val 


Gin 


Ala 




115 








120 


Ala Ser 


Gly His 


Phe 


He 


Cys 


Val 




130 








135 


Leu Val 


Phe Asp 


He 


Pro 


Ala 


Lys 




145 








150 


Glu Glu 


Leu Ala 


Gly 


His 


Gin 


Met 




160 








165 


Glu Pro 


Ala Gin 


Gly 


Gin 


Asp 


Cys 




175 








180 


Asp Asp 


Ser Gly Leu 


Leu 


Cys 


Val 




190 








195 


Thr Leu 


Leu Thr 


Arg 


He 


Pro Gly 




205 








210 


Val Gin 


Leu Trp 


Gin 


Gly 


He 


He 




220 








225 


Gin Val 


His Leu 


Tyr 


Glu 


Ala 


Thr 




235 








240 


He Asn 


Ala His 


Ala 


Arg 


Ala 


He 




250 








255 


Glu Val 


Gly Lys 


Leu 


Leu 


Ser 


Ala 




265 








270 


He Trp Lys Leu 


Ser Arg 


Asn 


Pro 




280 








285 


Glu His 


Cys His 


Gly Glu Cys Val 




295 








300 


Ala Arg 


Phe Cys Asp Ser 


Ser Gly 




310 








315 


Tyr Asp Leu Ala Glu 


He 


Arg Arg 




325 








330 



<210> 12 
<211> 569 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 323914 9 

<400> 12 



Met 


Asp 


Pro 


Ala 


Glu 


Ala 


Val 


1 






5 






Met 


Asn 


Ser 


Ser 


Glu 
20 


Arg 


Glu 


Arg 


Lys 


He 


He 


Pro 
35 


Glu 


Lys 


Ser 


Cys 


Ala 


Arg 


Leu 
50 


Cys 


Leu 


Ser 


Thr 


Ala 


Met 


Lys 
65 


Thr 


Glu 


Ala Asn Gly 


Thr 


Ser 


Ser 


Met 










80 






Ser 


Ala 


Ser 


Tyr 


Glu 


Lys 


Glu 



95 



Leu 


Gin 


Glu Lys 


Ala 


Leu 


Lys 


Phe 






10 








15 


Asp 


Cys 


Asn Asn 


Gly 


Glu 


Pro 


Pro 






25 








30 


Asn 


Ser 


Leu Arg 


Gin 


Thr 


Tyr Asn 






40 








45 


Asn 


Gin 


Glu Thr 


Val 


Cys 


Leu 


Ala 






55 








60 


Asn 


Cys 


Val Ala 


Lys 


Thr 


Lys 


Leu 






70 








75 


He 


Val 


Pro Lys 


Gin 


Arg 


Lys 


Leu 






85 








90 


Lys Glu Leu Cys 


Val 


Lys 


Tyr 


Phe 






100. 








105 
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Glu Gin Trp Ser Glu Ser Asp Gin Val Glu Phe Val Glu His Leu 

110 115 120 

He Ser Gin Met Cys His Tyr Gin His Gly His He Asn Ser Tyr 

125 130 135 

Leu Lys Pro Met Leu Gin Arg Asp Phe He Thr Ala Leu Pro Ala 

140 145 150 

Arg Gly Leu Asp His He Ala Glu Asn He Leu Ser Tyr Leu Asp 

155 160 165 

Ala Lys Ser Leu Cys Ala Ala Glu Leu Val Cys Lys Glu Trp Tyr 

170 175 180 

Arg Val Thr Ser Asp Gly Met Leu Trp Lys Lys Leu He Glu Arg 

185 190 195 

Met Val Arg Thr Asp Ser Leu Trp Arg Gly Leu Ala Glu Arg Arg 

200 205 210 

Gly Trp Gly Gin Tyr Leu Phe Lys Asn Lys Pro Pro Asp Gly Asn 

215 220 225 

Ala Pro Pro Asn Ser Phe Tyr Arg Ala Leu Tyr Pro Lys He He 

230 235 240 

Gin Asp He Glu Thr He Glu Ser Asn Trp Arg Cys Gly Arg His 

245 . 250 255 

Ser Leu Gin Arg He His Cys Arg Ser Glu Thr Ser Lys Gly Val 

260 265 270 

Tyr Cys Leu Gin Tyr Asp Asp Gin Lys He Val Ser Gly Leu Arg 

275 280 285 

Asp Asn Thr He Lys He Trp Asp Lys Asn Thr Leu Glu Cys Lys 

290 295 300 

Arg He Leu Thr Gly His Thr Gly Ser Val Leu Cys Leu Gin Tyr 

305 310 315 

Asp Glu Arg Val He He Thr Gly Ser Ser Asp Ser Thr Val Arg 

320 325 330 

Val Trp Asp Val Asn Thr Gly Glu Met Leu Asn Thr Leu He His 

335 340 345 

His Cys Glu Ala Val Leu His Leu Arg Phe Asn Asn Gly Met Met 

350 355 360 

Val Thr Cys Ser Lys Asp Arg Ser He Ala Val Trp Asp Met Ala 

365 370 375 

Ser Pro Thr Asp He Thr Leu Arg Arg Val Leu Val Gly His Arg 

380 385 390 

Ala Ala Val Asn Val Val Asp Phe Asp Asp Lys Tyr He Val Ser 

395 400 405 

Ala Ser Gly Asp Arg Thr He Lys Val Trp Asn Thr Ser Thr Cys 

410 415 420 

Glu Phe Val Arg Thr Leu Asn Gly His Lys Arg Gly He Ala Cys 

425 430 435 

Leu Gin Tyr Arg Asp Arg Leu Val Val Ser Gly Ser Ser Asp Asn 

440 445 450 

Thr He Arg Leu Trp Asp He Glu Cys Gly Ala Cys Leu Arg Val 

455 460 465 

Leu Glu Gly His Glu Glu Leu Val Arg Cys He Arg Phe Asp Asn 

470 475 480 

Lys Arg He Val Ser Gly Ala Tyr Asp Gly Lys He Lys Val Trp 

485 490 495 

Asp Leu Val Ala Ala Leu Asp Pro Arg Ala Pro Ala Gly Thr Leu 

500 505 510 

Cys Leu Arg Thr Leu Val Glu His Ser Gly Arg Val Phe Arg Leu 

515 520 525 

Gin Phe Asp Glu Phe Gin He Val Ser Ser Ser His Asp Asp Thr 

530 535 540 

He Leu He Trp Asp Phe Leu Asn Asp Pro Ala Ala Gin Ala Glu 

545 550 555 

Pro Pro Arg Ser Pro Ser Arg Thr Tyr Thr Tyr He Ser Arg 

560 565 
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<210> 13 
<211> 123 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 3315936 

<400> 13 



Met 


Glu 


Phe 


Leu 


Glu 


He 


Gly Gly Ser Lys Pro Phe 


Arg 


Ser 


Tyr 


1 








5 




10 






15 


Trp 


Glu 


Met 


Tyr 


Leu 


Ser 


Lys Gly Leu Leu Leu He 


Phe 


Val 


Val 






20 




25 






30 


Asp 


Ser 


Ala 


Asp 


His 


Ser 


Arg Leu Pro Glu Ala Lys 


Lys 


Tyr 


Leu 








35 




40 






45 


His 


Gin 


Leu 


He 


Ala 


Ala 


Asn Pro Val Leu Pro Leu 


Val 


Val 


Phe 










50 




55 






60 


Ala 


Asn 


Lys 


Gin 


Asp 


Leu 


Glu Ala Ala Tyr His He 


Thr Asp 


He 








65 




70 






75 


His 


Glu 


Ala 


Leu 


Ala 


Leu 


Ser Glu Val Gly Asn Asp Arg 


Lys 


Met 










80 




85 






90 


Phe 


Leu 


Phe Gly 


Thr 


Tyr 


Leu Thr Lys Asn Gly Ser 


Glu 


He 


Pro 










95 




100 






105 


Ser 


Thr 


Met 


Gin 


Asp 


Ala 


Lys Asp Leu He Ala Gin 


Leu 


Ala 


Ala 










110 




115 






120 


Asp 


Val 


Gin 

















<210> 14 

<211> 1957 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> raise-feature 

<223> Incyte Clone 016108 

<400> 14 

atttttgtca ctttctgtgt gaactaaagt 
acttcaagaa cacaagttga atcactcaga 
caatacttgg ttgatgagcc aaccctttcc 
gtactatgtt ccaccaacgt ttctcactat 
gacaacttga cttctgtcca tcttgcacgg 
aaaattacaa atctggaaaa ctgcaatgaa 
attctatccc actttttccg gcatcccaat 
ggcagctggc tttgggttat ttctccattt 
aggacctatt tccctgaagg aatgagtgaa 
gtgagagggt tgaactatct gcaccaaaat 
catatcctca tttctggtga tggcctagtg 
ttggttaagc atggacagag gcatagggct 
gtgcagccgt ggttgagtcc agaactactg 
tcagatattt acagtgttgg gatcacagca 
caggacatgc atagaactca gatgctgtta 
ttggatatca gtattttccc tcaatcagaa 
gactctggga ttggagaaag tgtgcttgtc 
cgattacaca caccatcctc aaaaactttc 
tgtttgcaac aagatcctga gaaaaggcca 
ttcaaacaga tgaaagaaga aagccaggat 
aacaagccat caatatcatt gcctccagtg 
cctgatgaaa aagactcata ctgggaattc 



gattcaatgt ctcttttgga ttgcttctgt 60 
cctgaaaaac agtctgaaac cagtatccat 120 
tggtcacgtc catccactag agccagtgaa 180 
gagctccaag tagaaatagg aagaggattt 240 
catactccca caggaacact ggtaactata 300 
gaacgcctga aagctttaca gaaagccgtg 360 
attacaactt attggacagt tttcactgtt 420 
atggcctatg gttcagcaag tcaactcttg 480 
actttaataa gaaacattct ctttggagcc 540 
ggctgtattc acaggagtat taaagccagc 600 
accctctctg gcctgtccca tctgcatagt 660 
gtgtatgatt tcccacagtt cagcacatca 720 
agacaggatt tacatgggtt atatgtgaag 780 
tgtgaattag ccagtgggca ggtgcctttc 840 
cagaaactga aaggtcctcc ttatagccca 900 
tccagaatga aaaattccca gtcaggtgta 960 
tccagtggaa ctcacacagt aaatagtgac 1020 
tctcctgcct tctttagctt ggtacagctc 1080 
tcagcaagca gtttattgtc ccatgttttc 1140 
tcaatacttt cactgttgcc tcctgcttat 1200 
ttaccttgga ctgagccaga atgtgatttt 1260 
tagggctgcc aaatcatttt atgtcctata 1320 
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tacttgacac tttctccttg ctgctttttc 
tatacttgaa aatacagttg gtgcactgga 
aggggcacca gtttgtagtc cctctgtttc 
aattactaat ctagctagtg tcatttattc 
tttttatctc tcaatataat ttttgagcca 
tgggaatggg ccctcagagg acagtgcttc 
tttttaatga gctattgtta aaccaacagg 
gtagagggaa aatgtttgtg ctttcccttt 
actgagcctc actcacatta aatgattcac 
tttttttaaa aaagggggct aaagtaacac 
attcacgcac cccgtgggag ctcaataaag 



ttctgtattt ctaggtacaa ataccagaat 1380 
gaatctatta tttaaaacca ctctgttcaa 1440 
gcacagagta ctatgacaag gaaacatcag 1500 
tggaattttt ttctaagctg tgactaactc 1560 
gttaattttt ttcagtattt tgctgtccct 1620 
caagtacatc ttctcccaga ttctctggcc 1680 
ctagtttatc ttacatcaga cccttttctg 1740 
ttcttctgtt aatacttatg gtaacaccta 1800 
ttgaaatata tacagaaatt gtaatttgct 1860 
tttcctactt atgtaaatta tagatcctaa 1920 
atttact 1957 



<210> 15 

<211> 2545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 640521 

<400> 15 

gagccgagct gggggcgcag agcgcgggag 
ctgcagtgca gtggtgcgat ctcggctcag 
tgattctcct gcctcagcct cccgagtagc 
ggtggctccg ctgccggatg ggagtgcccc 
atgtcatcat gtgtctctag ccagcccagc 
gggggcaggg gcagcagcag cagcgaaagc 
tcatccttga gcatccacct gggcatggag 
ggctgtgctg tggacctcgg cttggcgcgg 
gtcccccttg actcctccgg gtcccaggcc 
ctgcaagagc ggtcccaggg tgggctggca 
tgcatctgcc cgtccctgcc ctactcaccc 
ccccggcggc cgacagtgga gtctcaccac 
cagctgaatc agtataccct gaaggatgaa 
ttggcctaca atgaaaatga caatacctac 
ctgatccggc aggccggctt tccacgtcgc 
ggaggctgca tccagcccag gggccccatt 
aagaagctgg accaccccaa tgtggtgaag 
gaccatctgt acatggtgtt cgaactggtc 
ctcaaaccac tctctgaaga ccaggcccgt 
gagtacttac actaccagaa gatcatccac 
ggagaagatg ggcacatcaa gatcgctgac 
gacgcgctcc tctccaacac cgtgggcacg 
gagacccgca agatcttctc tgggaaggcc 
tactgctttg tctttggcca gtgcccattc 
aagatcaaga gtcaggccct ggaatttcca 
gacctgatca cccgtatgct ggacaagaac 
aagctgcacc cctgggtcac gaggcatggg 
tgcacgctgg tcgaagtgac tgaagaggag 
ttggcaaccg tgatcctggt gaagaccatg 
gagggcagcc ggcgggagga acgctcactg 
ggcagcgaag acaacctcca gggcaccgac 
ttgtgagagg cagtccctgc gtggaaagtt 
gcatgcatcc actgcggccg gaggaggcca 
acctcgcatg cgcgccgcgt cgcctctggg 
tgtcctacgg aaacccagca cgtgtgtaga 
tttcctttgt tgttttaaag gggacaaaaa 
tcgaccgtgg ccgctggctg gctggacagg 
cgtgcatttt gggacaattg ctttttaaaa 
ttttcagaac cacgtcagat ataccaagtg 
ggcgagcaga aaactccggc ggtctgaggc 
gtagggggct agatgtggct cctagtgcaa 



gcggcggcgg cgcggaccca gtcacccagg 60 
tcattgcaac cttcacctcc cggattcaag 120 
tgggattaca ggtgcccacc accatgccca 180 
agtgtgctgg atgaagctgg cgcatgcacc 240 
agcaaccggg ccgcccccca ggatgagctg 300 
cagaagccct gtgaggccct gcggggcctc 360 
tccttcattg tggtcaccga gtgtgagccg 420 
gaccggcccc tggaggccga tggccaagag 480 
cggccccacc tctccggtcg caagctgtct 540 
gccggtggca gcctggacat gaacggacgc 600 
gtcagctccc cgcagtcctc gcctcggctg 660 
gtctccatca cgggtatgca ggactgtgtg 720 
attggaaagg gctcctatgg tgtcgtcaag 780 
tatgcaatga aggtgctgtc caaaaagaag 840 
cctccacccc gaggcacccg gccagctcct 900 
gagcaggtgt accaggaaat tgccatcctc 960 
ctggtggagg tcctggatga ccccaatgag 1020 
aaccaagggc ccgtgatgga agtgcccacc 1080 
ttctacttcc aggatctgat caaaggcatc 1140 
cgtgacatca aaccttccaa cctcctggtc 1200 
tttggtgtga gcaatgaatt caagggcagt 1260 
cccgccttca tggcacccga gtcgctctct 1320 
ttggatgttt gggctatggg tgtgacacta 1380 
atggacgagc ggatcatgtg tttacacagt 1440 
gaccagcccg acatagctga ggacttgaag 1500 
cccgagtcga ggatcgtggt gccggaaatc 1560 
gcggagccgt tgccgtcgga ggatgagaac 1620 
gtcgagaact cagtcaaaca cattcccagc 1680 
atacgtaaac gctcctttgg gaacccattc 1740 
tcagcgcctg gaaacttgct cacgaagcaa 1800 
ccgccccccg tgggggagga ggaagtgctc 1860 
gctgggcccc cgcccccggc tcccccgcac 1920 
tggagcccga gtagctgcct ggatcgctcg 1980 
gggctgctgc accgcgtttc catagcagca 2040 
gcctcgatcg tcatctctgg ttatttgttt 2100 
aaaaaaaaaa aaggacttga ctccatgacg 2160 
cgggtgtgag gagttgcaga cccaaaccca 2220 
cgtttttatg ccaaaaatcc ttcattgtga 2280 
actgtgtgtg gggtttgaca actgtggaaa 2340 
catggaggtg gttgctgcat ttgagaggga 2400 
accggaaacc atggcacctt ccagagccgt 2460 
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ggtctcaagg agtcagagca gggctggccc tcagtagctg cagggagctt tgatggcaac 2520 
ttattttgtt aagaagggtt ttttt 2545 



<210> 16 

<211> 3034 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1250171 



<400> 16 

tcctgagtct 

gtataacaag 

ttttatctga 

aaactggtga 

atgttcaaat 

tatttgctat 

catgtgggag 

ctgaattctt 

gtatagtgca- 

agtctgtgta 

ttgtttctct 

taagaaaaga 

cattttacca 

ataaagaagt 

agaaagcaga 

ctcggggtct 

aaaagtgttg 

taattcatgt 

atactgctac 

aagaacttat 

tccctaaaac 

taggattaat 

gggatgctag 

gtaccttact 

ttaaagatga 

tctgtatcag 

tggaagcggc 

gttctcaggg 

gatcactgqc 

aaaaactaca 

aagcagaacg 

tgtattacca 

aggcattttt 

tattatcgct 

atactaatga 

aacataccat 

tgaagaaatt 

ttttagaaag 

agctttctaa 

ttaaatgaat 

tgtaaatatg 

gatgtaattt 

gagaagaaag 

taaataagtt 

catccttaca 

tgtgagtggg 

ataaagtggc 

taatttcaga 

atttttttcc 

gttttaagct 



cgaggaggcc 

aggattgcct 

tattttaggc 

tttatttgct 

gagagaattt 

tgaagaggag 

tttatacact 

aattgttttg 

ccgtgatatc 

caaactcaca 

gtatggcaca 

tcatcagaag 

tgcagctact 

gatgtataaa 

aaatggacca 

tcaggttcta 

gggttttgac 

tttttcgcta 

tatatttcat 

ctacgaaggg 

tactgaggaa 

atatgaaaaa 

catggctaag 

gctttatcag 

ttacaatgaa 

aaacattgaa 

agagttaggt 

aacaatagaa 

agacgcatgg 

agtcctgtta 

tagattagct 

tgccacaaaa 

gaataagtca 

gactaatcag 

gttacaagaa 

gaccccaatt 

aaaggaagag 

gtttggctct 

tagaagttta 

gcctttatag 

tacaatattg 

tatcttttaa 

ccatgaccga 

attttctctg 

aatcaggaag 

gagcagaacc 

attattttat 

tgaattatta 

tttggccata 

gtatatttct 



gcgggagccc 

gatccagcca 

caaggagcta 

atcaaagtat 

gaagtgttga 

acaacaacaa 

gttttagaag 

cgagatgtgg 

aagccaggaa 

gattttggtg 

gaagaatatt 

aaatatggag 

ggatcactgc 

ataattacag 

attgactgga 

cttacccctg 

cagttttttg 

caacaaatga 

gaactggtat 

cgacgcttag 

aaccctatat 

atttccctcc 

gcaataacag 

gaattaatgc 

actgttcaca 

aaaactgtga 

gaaatttcag 

accagtcttc 

gcacatcaag 

aattgcatga 

tataatgaag 

gctatgacgc 

gaagaatgga 

tgttttgata 

actctgcctc 

tatccaagtt 

atggaagggg 

ttaaccatgg 

agaaaagttt 

atagtcactt 

taaatacata 

catttataat 

ccaatatgtt 

accgcctact 

actgacttga 

gcaccactgt 

tttacaaggt 

agcaaacatt 

aatgtgtaat 

ttaattctgc 



gccggcggtg 

agatgcagag 

ctgcaaacgt 

ttaataacat 

aaaaactcaa 

gacataaagt 

aaccttctaa 

tgggtggaat 

atatcatgcg 

cagctagaga 

tgcaccctga 

caacagttga 

catttagacc 

gaaagccttc 

gtggagacat 

ttcttgcaaa 

cagaaactag 

cagctcataa 

ataaacaaac 

tcttagaacc 

ttgtagtaag 

ctaaagtaca 

gggttgtgtg 

gaaaggggat 

aaaagacaga 

aagtatatga 

acatacacac 

aggatatcga 

aaggcactca 

cagagattta 

aacaaatcca 

actttacaga 

taagaaagat 

ttgaagaaga 

agaaaatgtt 

ctaacacatt 

tggttaaaga 

atggtggcct 

ccgtttgcac 

gtttctacaa 

aaaaatatac 

tatatgagga 

gacatactga 

ggaaatattt 

cacgtttgta 

tatactggga 

gcccagatcc 

ttaaagtgaa 

tgtcattaaa 

ttactatttc 



gcgcggcgga 
cacttctaat 
ctttcgtgga 
aagcttcctt 
tcacaaaaat 
acttattatg 
tgcctatgga 
gaatcatcta 
tgttataggg 
attagaagat 
tatgtatgag 
tctttggagc 
ctttgaaggg 
tggtgcaata 
gcctgtttct 
catccttgaa 
tgatatactt 
gatttatata 
caaaattatt 
tggaaggctg 
ccgggaacct 
tccacgttat 
ttatgcctgc 
acgatggctg 
agttgtgatc 
aaagttgatg 
caaattgttg 
cagcagatta 
tccgaaagac 
ctatcagttc 
caaatttgat 
tgaatgtgtt 
gcttcatctt 
agtatcaaaa 
tacagcttcc 
agtagaaatg 
acttgctgaa 
tcgcaacgtt 
aagaaaataa 
ttcagtattt 
aaatttttgg 
aatttgacct 
tcctctactc 
ttaagtggaa 
aatggtagaa 
taacaatttt 
cagttatcct 
ttcattatta 
attctaaggt 
atggaaaaaa 



gacccggctg 
catctgtggc 
agacataaga 
cgtccagtgg 
attgtcaaat 
gaattttgtc 
ctaccagaat 
cgagagaatg 
gaagatggac 
gatgagcagt 
agagcagtgc 
attggggtaa 
cctcgtagga 
tctggagtac 
tgcagtcttt 
gcagatcagg 
caccgaatgg 
catagctata 
tcttcaaatc 
gcacaacatt 
ctgaatacca 
gatttagacg 
agaattgcca 
attgaattaa 
acattggatt 
aagatcaacc 
agactttcca 
tctccaggtg 
agaaatgtag 
aaaaaagaca 
aagcaaaaac 
aaaaagtatg 
aggaaacagt 
tatcaagaat 
agtggaatca 
actcttggta 
aataaccaca 
gactgtcttt 
cgcttgggca 
gatgtggtcg 
ctgctgtgaa 
cagtgatcac 
tgagtggggc 
ccaaaatagg 
cggtggctac 
tttgagaagg 
tgtatccatg 
aaaactattc 
catttcaact 
ataaatttct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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caattttaaa aaattttttt ataaaaaaaa aiaaa 3034 



<210> 17 

<211> 1337 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1911587 

<400> 17 

gaaagctgtg ggaccatcct ggcaaccccg gtgtttggct gggttctagc gtaccgtctg 60 

tgtggccggt gggggacctg cggtcggagt gggagggcca gtctgcaccc aagaggtgga 120 

agaggacggg ctttaggctg gaagcgcctt agaggagcca tttttccagg tggggcccca IBO 

ggcagaggct ccgacaggga gcctggccat agtcgcgcac caggggaggt ggagcgcgtc 240 

ccagacccga gcccccgacc tcagccaaac ccattccttc tgtccttgga ggccagaggg 300 

gactctgagc atcggaaagc aggatgcctg gtttgctttt - atgtgaacca acagagcttt 360 

acaacatcct gaatcaggcc acaaaactct ccagattaac agaccccaac tatctctgtt 420 

tattggatgt ccgttccaaa tgggagtatg acgaaagcca tgtgatcact gcccttcgag 480 

tgaagaagaa aaataatgaa tatcttctcc cggagtctgt ggacctggag tgtgtgaagt 540 

actgcgtggt gtatgataac aacagcagca ccctggagat actcttaaaa gatgatgatg 600 

atgattcaga ctctgatggt gatggcaaag atcttgtgcc tcaagcagcc attgagtatg 660 

gcaggatcct gacccgcctc acccaccacc ccgtctacat cctgaaaggg ggctatgagc 720 

gcttctcagg cacgtaccac tttctccgga cccagaagat catctggatg cctcaggaac 780 

tggatgcatt tcagccatac cccattgaaa tcgtgccagg gaaggtcttc gttggcaatt 840 

tcagtcaagc ctgtgacccc aagattcaga aggacttgaa aatcaaagcc catgtcaatg 900 

tctccatgga tacagggccc ttttttgcag gcgatgctga caggcttctg cacatccgga 960 

tagaagattc cccggaagcc cagattcttc ccttcttacg ccacatgtgt cacttcattg 1020 

aaattcacca tcaccttggc tctgtcattc tgatcttttc cacccagggt atcagccgca 1080 

gttgtgccgc catcatagcc tacctcatgc atagtaacga gcagaccttg cagaggtcct 1140 

gggcctatgt caagaagtgc aaaaacaaca tgtgtccaaa tcggggattg gtgagccagc 1200 

tgctggaatg ggagaagact atccttggag attccatcac aaacatcatg gatccgctct 1260 

actgatcttc tccgaggccc accgaagggt actgaagagc ctcacctggg ggcattttgt 1320 

gggtggaggg ccagagt 1337 



<210> 18 

<211> 1639 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 2079081 

<400> 18 

gacaaaagcc agacacattt caacatgagg gacccactga cagattgtcc gtataataaa 60 
gtatacaaga acctaaagga gttttctcaa aatggagaga atttctgcaa acaggtcaca 120 
tctgttcttc agcaaagggc aaacctggaa attagctatg ccaaaggact tcagaaactg 180 
gcaagcaagc tgagcaaagc attacagaac acgagaaaaa gttgtgttag cagtgcctgg 240 
gcctgggcct cagagggaat gaaatccaca gcggacctgc atcaaaaact tggcaaagca 300 
attgaattgg aagcaataaa accgacttat caagtcctaa atgtacaaga gaagaagaga 360 
aaatcacttg acaatgaagt tgaaaagaca gcaaatcttg tcattagcaa ctggaatcag 420 
caaattaagg ccaagaagaa attaatggtt agtaccaaga aacatgaagc acttttccag 480 
cttgtagaaa gctccaagca atctatgact gagaaggaga agcggaagct cctcaataaa 540 
ctgacaaaat caactgaaaa gttggaaaag gaagatgaaa attactacca aaaaaacatg 600 
gcgggttatt ctacqagact gaaatgggaa aacacactag agaactgcta ccagagcatt 660 
ctggagctgg agaaggaaag aattcaactt ttatgcaata acttaaacca gtacagccaa 720 
catatttctc tttttggcca aaccctgacc acatgccaca cgcagattca ctgtgccatc 780 
agcaagattg acattgaaaa agatatccag gctgtaatgg aagaaactgc aattttatct 840 
acagaaaaca aatctgagtt cctgttaacg gattactttg aagaagatcc taacagtgca 900 
atggataaag agagacgaaa gtctttacta aaaccaaaat tattgagact gcagagagac 960 
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attgaaaaag cctcaaaaga caaggaaggc ctggaacgaa tgcttaaaac gtactccagc 1020 

acctcctcct tctctgatgc aaagagccag aaagacacag cagcgttaat ggatgagaac 1080 

aatttgaaac tagacctttt ggaagcgaac tcctacaaac tgtcatcaat gttagcagaa 1140 

cttgagcaaa gacctcaacc cagccatcct tgtagtaatt ccatcttcag gtggagggaa 1200 

aaggagcata ctcatagcta tgtgaaaata tctcggcctt ttttaatgaa gagattagag 1260 

aatattgtga gcaaggcatc ttctggtggg cagagcaatc caggttcttc aactccagcc 1320 

cctggtgcag cccagctcag cagcagactt tgcaaggcct tgtattcttt tcaagccagg 1380 

caagatgatg agttgaattt ggaaaagggt gacattgtga ttatacacga gaaaaaagaa 1440 

gaaggatggt ggtttggatc tttgaatggg aaaaaaggcc attttcctgc cgcttatgtg 1500 

gaggagttac cttcaaatgc tggcaacaca gctacaaagg cataaaacaa gactctgaac 1560 

atactacctt cacactcggt aatcaacaat acagtgtggt tcaaataaga ataaagtgct 1620 

cttaccttta aaaaaaaaa 1639 



<210> 19 

<211> 1504 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 2472655 

<400> 19 

cgaaatcgta ggacttccga aagcagcggt 
tgcgcgaaga tgcgaaaggt ggttttgatc 
ctctgcaagc ggctgctggc ggaagatgat 
atgagcaagg cagaagctgt ctgtgctgct 
accattgtcc aggtggatgt cagcaacctg 
aagcaaaggt ttcagagatt agactgtata 
caactaaata tcaaagcact tttctttggc 
tccacagctg aaggcctgct gacccagggt 
gtgtttgaga ccaatgtctt tggccatttt 
tgtcacagtg acaatccatc tcagctcatc 
aatttcagcc tcgaggactt ccagcacagc 
tatgccactg accttttgag tgtggctttg 
tccaatgtgg cctgtccagg tacagcattg 
tttatatgga cgctgttgat gccggcaata 
actttgacac catataatgg aacagaagct 
tctctcaatc ctctgatcaa atatctgagt 
atgacccaga agatggacct agatgaagac 
gaactggaaa agcacattag ggtcactatt 
ggctcatgcc tataattcca gcactttggg 
ggagttcaag accagcctga gaaacatagt 
ataatagctg ggtgtggtgg catgcgcatg 
gaggatctct tgaggctggg aggcagaggt 
ccagcctggg tgacagcgag accctgtctc 
aaccagagct gacaatgaca ctctggaaca 
tggatttcta ctgagttgga taatatgcat 
aaaa 



ggcgtttgct tcactgcttg gaagtgtgag 60 
accggggcta gcagtggcat tggcctggcc 120 
gagcttcatc tgtgtttggc gtgcaggaac 180 
ctgctggcct ctcaccccac tgctgaggtc 240 
cagtcggtct tccgggcctc caaggaactt 300 
tatctaaatg ctgggatcat gcctaatcca 360 
ctcttttcaa gaaaagtgat tcatatgttc 420 
gataagatca ctgctgatgg acttcaggag 480 
atcctgattc gggaactgga gcctctcctc 540 
tggacatcat ctcgcagtgc aaggaaatct 600 
aaaggcaagg aaccctacag ctcttccaaa 660 
aacaggaact tcaaccagca gggtctctat 720 
accaatttga catatggaat tctgcctccg 780 
ttgctacttc gcttttttgc aaatgcattc 840 
ctggtatggc ttttccacca aaagcctgaa 900 
gccaccactg gctttggaag aaattacatt 960 
actgctgaaa aattttatca aaagttactg 1020 
caaaaaacag ataatcaggc caggctcagt 1080 
aggccaaggc agaaggatca cttgagacca 1140 
gagcccttgt ctctacaaaa agaaaraaaa 1200 
tagtcccagc tactcagaag gatgaggtgg 1260 
tgcagtgagc tgagattgtg ccactgcact .1320 
aaaatatgta tatatttaat atatatataa 1380 
ttgcatacct tctgtacatt ctggggtaca 1440 
ttgtaataaa ctatgaacta tgaaaaaaaa 1500 

1504 



<210> 20 

<211> 3096 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 2948818 

<400> 20 

gggtgttctt ataacttaag ttcagttttt tctttcctgt gaggaagagg cagtttttta 60 
aatgaagcca tctctgggga aatcgtattg attgttgtag ctaaatacgg aatttttaaa 120 
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gtctttagta tgttgaactg gaaatatagg acatgcgttt gagatctact gtgagttgca 180 
tcataaatac aaaggactga agttataaaa gagaaaagag aagtttgctg ctaaaatgaa 240 
tctgagcaat atggaatatt ttgtgccaca cacaaaaagg tactgaggat ttacccccca 300 
aaaaaaattg tcaatgagaa ataaagctaa ctgatatcaa aaagcagagc ctgctctact 360 
ggccatcatg cgtaaagggg tgctgaagga cccagagatt gccgatctat cctacaaaga 420 
tgatcctgag gaacttttta ttggtttgca tgaaattggg catggaagtt ttggagcagt 480 
ttattttgct acaaatgctc acaccagtga ggtggtggca attaagaaga tgtcctatag 540 
tgggaagcag acccatgaga aatggcaaga tattcttaag gaagttaaat ttttacgaca 600 
attgaagcat cctaatacta ttgagtacaa aggctgttac ttgaaagaac acactgcttg 660 
gttggtgatg gaatattgct taggctcagc ctctgattta ttagaagttc ataaaaaacc 720 
acttcaggaa gtggagatcg ctgccattac tcatggagcc ttgcatggac tagcctacct 780 
acattctcat gcattgattc atagggatat taaagcagga aatattcttc taacagagcc 840 
aggtcaggta aaactagctg attttggatc tgcttcaatg gcttctcctg ccaactcctt 900 
cgtgggcaca ccttactgga tggctccaga ggtgatctta gctatggatg aaggacagta 960 
tgatgggaaa gttgatattt ggtcacttgg catcacttgt attgaattgg cggaacggaa 1020 
gccgcccctt ttcaacatga atgcaatgag tgccttatat cacattgccc agaatgactc 1080 
cccaacgttia cagtctaatg aatggacaga ctcctttagg agatttgttg attactgctt 1140 
gcagaaaata cctcaggaaa ggccaacatc agcagaacta ttaaggcatg actttgttcg 1200 
acgagaccgg ccactacgtg tcctcattga cctcatacag aggacaaaag atgcagttcg 1260 
tgagctagat aacctacagt accgaaaaat gaaaaaaata cttttccaag agacacggaa 1320 
tggacccttg aatgagtcac aggaggatga ggaagacagt gaacatggaa ccagcctgaa 1380 
cagggaaatg gacagcctgg gcagcaacca ttccattcca agcatgtccg tgagcacagg 1440 
cagccagagc agcagtgtga acagcatgca ggaagtcatg gacgagagca gttccgaact 1500 
tgtcatgatg cacgatgacg aaagcacaat caattccagc tcctccgtcg tgcataagaa 1560 
agatcatgta ttcataaggg atgaggcggg ccacggcgat cccaggcctg agccgcggcc 1620 
tacccagtca gttcagagcc aggccctcca ctaccggaac agagagcgct ttgccacgat 1680 
caaatcagca tctttggtta cacgacagat ccatgagcat gagcaggaga acgagttgcg 1740 
ggaacagatg tcaggttata agcggatgcg gcgccagcac cagaagcagc tgatcgccct 1800 
ggagaacaag ctgaaggctg agatggacga gcaccgcctc aagctacaga aggaggtgga 1860 
gacgcatgcc aacaactcgt ccatcgagct ggagaagctg gccaagaagc aagtggctat 1920 
catagaaaag gaggcaaagg tagctgcagc agatgagaag aagttccagc aacagatctt 1980 
ggcccagcag aagaaagatt tgacaacttt cttagaaagt cagaagaagc agtataagat 204.0 
ttgtaaggaa aaaataaaag aggaaatgaa tgaggaccat agcacaccca agaaagagaa 2100 
gcaagagcgg atctccaaac ataaagagaa cttgcagcac acacaggctg aagaggaagc 2160 
ccaccttctc actcaacaga gactgtacta cgacaaaaat tgtcgtttct tcaagcggaa 2220 
aataatgatc aagcggcacg aggtggagca gcagaacatt cgggaggaac taaataaaaa 2280 
gaggacccag aaggagatgg agcatgccat gctaatccgg cacgacgagt ccacccgaga 2340 
gctagagtac aggcagctgc acacgttaca gaagctacgc atggatctga tccgtttaca 2400 
gcaccagacg gaactggaaa accagctgga gtacaataag aggcgagaaa gagaactgca 2460 
cagaaagcat gtcatggaac ttcggcaaca gccaaaaaac ttaaaggcca tggaaatgca 2520 
aattaaaaaa cagtttcagg acacttgcaa agtacagacc aaacagtata aagcactcaa 2580 
gaatcaccag ttggaagtta ctccaaagaa tgagcacaaa acaatcttaa agacactgaa 2640 
agatgagcag acaagaaaac ttgccatttt ggcagagcag taTigaacaga gtataaatga 2700 
aatgatggcc tctcaagcgt tacggctaga tgaggctcaa gaagcagaat gccaggcctt 2760 
gaggctacag ctccagcagg aaatggagct gctcaacgcc taccagagca aaatcaagat 2820 
gcaaacagag gcacaacatg aacgtgagct ccagaagcta gagcagagag tgtctctgcg 2880 
cagagcacac cttgagcaga agattgaaga ggagctggct gcccttcaga agga^cgcag 2940 
cgagagaata aagaacctat tggaaaggca agagcgagag attgaaactt ttgacatgga 3000 
gagcctcaga atgggatttg ggaatttggt tacattagat tttcctaagg aggactacag 3060 
atgagattaa attttttgcc atttacaaaa aaaaaa 3096 



<210> 21 

<211> 1527 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis c- feature 

<223> Incyte Clone 054191 

<400> 21 

ctccgcttga ggagaagcgc caagtgcgca tggggacgct atagcaattc gtttgctgtc 60 
cttcctctcc ttcgaagatg acaaggccta ccatcgtttc ttcctgcctt tgggccgtca 120 
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ggcagttggt 
tgtcatggct 
taatatacag 
cctgggactt 
catcatctgc 
caccgtcaat 
acagagcaca 
tttctcaagc 
tgaatgtttc 
agccatggct 
tggtatcgcc 
tatccgagcc 
acagtgggca 
ttgctgctgg 
caagcttcac 
caactgtgga 
gcgtgatgca 
cttaaattca 
ctacaatact 
ttgggaaagg 
atatagtgcc 
cagattcact 
gagatagact 
taaattctag 



tgggacccgc 
actctgagtg 
aaatctccag 
tgctggctca 
tataacctac 
tacatctggt 
tctccactgg 
ctgatacctc 
caaaacttca 
gaacttccac 
attttcctgg 
cagacattgt 
gtgattcaca 
ggcccagctg 
atggcccttt 
gtatatggct 
gatacccaga 
ctggaatcca 
ggaacatcca 
acatcttaaa 
attgttagta 
agtgatcctt 
cgtgttacct 
atrartaaaa 



tccaaccctc 
ttataggttc 
agataagacc 
cggagacact 
aagcagttgg 
atttgtacac 
tgatagatta 
tgctattgat 
gtcagagcca 
cttctgccaa 
gcagctttgt 
ataagaagtt 
ttgtggacca 
tcattctaat 
atgttctcca 
ggacgcagca 
caccattatt 
ccctgacttt 
ggaactggag 
tcttttctaa 
gtcattctag 
aaagtctcta 
tgatggatat 
aaaaaaa 



ggttcttcct 
aagttcactt 
acttttttat 
tctctatgga 
acagatattc 
agagctgagg 
tacttgtcga 
gacacctgta 
caagtgtatc 
cacatctgtc 
actcagcctc 
tgtgaagtca 
acgggtgcgc 
gatcataaag 
ggctctaacg 
caaattccac 
atgctcacag 
tcctgccagt 
ttattctacg 
ctatgcccta 
atgaattggg 
ttcagggaga 
tggatttgtc 



gcaatacagt 
attgcctatg 
ctgagcttct 
gcttcagtag 
tacatttcct 
atgaaacaca 
gttggtcaaa 
ttctgtctgg 
ttgatgcact 
tgtagcacac 
cttaccatta 
actggctttc 
ttctacccag 
ctgactaagc 
gcaacatctc 
caactaaagc 
aagagattct 
acttctacca 
ctaatggatt 
aactgcagaa 
agtatctctc 
ggaagacact 
taagtctctt 



ggatacaatt 
ctgtattcca 
gtgacctgct 
caaataagga 
catttctcta 
cccagagtgg 
tggcctttgt 
gaaatactag 
caccaccatc 
tttattttta 
tggtcttact 
tggggagtga 
tggccttctt 
cacaggacac 
agggtctact 
aggaggctcg 
atagcagggg 
ttttttgaaa 
ggaaagaatg 
ctcaaaggaa 
cagttattcc 
ttccatctca 
ctagaaaaaa 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1527 



<210> 22 

<211> 2948 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1403604 

<400> 22 

aaagaaaggt cagccgcaag cgaacttagc 
cgagatgcgc tcttcccgga agtgacgcac 
atgggtggtt tttttccccc taccgaggtc 
cacaaccgtg acagactgcc attcctgagt 
agcgtcgggc aggcgcggta cagaacacca 
tggccgcgcg cgggtacgtg caggacccct 
gccgcgcacc gctcattcac cgaggctact 
tgcgcgcttt tttggagcag attggcgcgc 
ctctcggcgc tggcttcgac tcgctctatt 
gggctgcagt ctgggaggtg gattttccgg 
gagagacgcc agagctgtgc gcgttaaccg 
cgctgtgctt tgagagcgca gactactgca 
gagtggagga ggccctgggc gccgcggggc 
ccgaggcggt gctgacctac ctcgagccgg 
cccagcgttt tcctaatgcc cttttcgtgg 
ttggccagtt catgctgcaa cattttcggc 
gttttcctga cgtggaggcg cagcggcgcc 
gtgccgtgga cataaatgaa ttctatcact 
tggaaaatat tgaacccttt gacgaatttg 
tcattctggc agcttctagg ggagacaccc 
aggcatttcc tcgcgtaaat cctgcttcgc 
gtagcgaggg ccaggrccca aacctgaaga 
cagacgttat tctcagtgca ggaggatttg 
gccagtttca cttgctctca agagattgtg 
gttgtgggac tggagttcag tgggatggac 
agagtcgggt tctggttctg ggagggagac 
tccagcttca tttttttaag agtgaggata 
caaaggctgg ccgaaaggat gattccactt 



actggctaca ccctcctcaa ttctggttgg 60 
aagtgccggc ggaaggggaa gtccaggags 120 
cgtgaggtgt gtgctaacca aggggcggct 180 
ctcttctggc catgggcccc cggagccgtg 240 
acgacagcag cgccctcagc aagcgttccc 300 
ttgccgcgtt gctggttccg ggcgcggcgc 360 
acgtccgcgc acgcgccgtg aggcactgcg 420 
cccaggccgc gcttcgcgcg cagatcttgt 480 
ttcgcttaaa aaccgcgggc cgcctggccc 540 
acgtggcgcg gcgcaaagca gaaaggattg 600 
ggcctttcga gaggggggag cccgcgtccg 660 
tcctgggtct ggacttgcgg cagctccagc 720 
tcgacgcagc ctcacccact ctgctcctgg 780 
agagtgccgc ggccctcatc gcctgggcag 84 0 
tctatgagca gatgaggcct caagacgcct 900 
agctaaactc ccccctgcat ggcctggagc 960 
gcttccttca agctggctgg accgcctgcg 1020 
gctttcttcc cgcagaagaa cgccggcggg 1080 
aggagtggca tctgaagtgc gcccattatt 1140 
tctcccacac cctagtgttt ccatcctcag 1200 
cttcaggggt attccctgcc agcgtagtca 1260 
gatatggcca cgcctctgtc ttcttgagcc 1320 
gagagcagga ggggcggcac tgccgagtga 1380 
actctgaatg gaaaggcagc caaataggca 14 40 
gcctttatca caccatgaca agactctcag 1500 
tgtccccagt aagtccagcc ttgggggttc 1560 
ataacactga ggacctgaaa gtgacaataa 1620 
tgtgttgttg gcggcattca acaacagaag 1680 
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tgtcctgtca gaatcaggaa tatttgtttg 
tactaagtga ctggcatttc ctccatgtag 
agggagaagt acctgaagcc cggcattctc 
ttattgctgg aggtctcggg gcttctgagg 
caatctcttg tggattcctc tgggagtcag 
actcccacac agctcatgtg ctcaatggaa 
attcctcctc atttcctgga gtgactgtga 
atcagattga cacaacatat gtgccatggc 
ttcttcctga agagcaacag ctcctgctcc 
gtacctactt caacccccat acagtcacat 
aaggactgga ctaatattca ggacccacta 
atgaccctct agctatagat actgccactc 
actattcagt gcaaaaagtg aaaaaggttg 
actacagaaa acagctgaag acagtggcca 
taattttcta atctaagttc gtttattgag 
ttcccaatcc caggccttgt gatttgagat 
gtggttgaat atatttgaga tggtacctta 
tatagttgag ggagcgtagt attgacaatt 
tcacagtttg attaaaacat tgtcctgttt 
aacaggaaac acttgtagga cttcctttac 
ttccatattt atttagattt ataagtgtca 
aaaaaaaa 



tgtatggggg tcgaagcgtg gtggaacctg 1740 
ggacaatggc ttgggtcagg atcccagtgg 1800 
acagtgcctg cacttggcaa gggggagccc 1860 
agccattgaa ctctgtgctc tttctgagac 1920 
tagacatcca gcctcccatt accccaaggt 1980 
agctgttact ggttggaggg atctggattc 2040 
tcaatttgac tacaggattg agctctgagt 2100 
cattaatgtt acacaaccat actagtatcc 2160 
ttggaggtgg tgggaactgc ttttcctttg 2220 
tagacctttc ttccttaagt gctgggcagt 2280 
aagtagacaa taaagttttc cacaaatagg 2340 
ctcctttccc catccttttt ttcccttagc 2400 
gtaaaatagg taaaatacct agaaacaatc 24 60 
tgcagtccga gaggagtagt ggtctgcctc 2520 
ttacagtggt ctttagtaaa gtaaaacaat 2580 
ggtaccttag aaaaagttac acgcagttcc 2640 
gaaaaagttt cacgcagatc cttggttgaa 2700 
cttcatgtag gaaacctgaa atgaacacag 27 60 
gttgcaacag aaaactcgga tagttttaac 2820 
caacatactt tttaaatgtt ttgctattgg 2880 
ataaagcaaa cttttgatgc ctcaaaaaaa 2 94 0 

2948 



<210> 23 

<211> 1808 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1652936 

<400> 23 

gagagtatta aatgtgaatt gcctctgcct 
atacgatttg ggaacccagc tgagctgcat 
acagaaaaaa attccttcaa aagaatggac 
acgatgtcac ccctcgcctc ccctccttct 
agccatggct acagcaaact gcggagcagc 
gctaggcagc cccttgtcca gcccgaggga 
cttcggcatc aggcttccct tatccggtcc 
agcagttacc ccgacgaacc ctggaggata 
cagttccgat cccttcagcc agacccaagc 
ttcttcacca aatcaaagct ttccattcca 
gctgactgtg atggagccct gaccctgcct 
gctcggaaga acggctaccc attgcctgag 
ctgcaggcag cttttcctaa gcccaaatgg 
tcactgccgg caaatcaaca acctcgtgac 
atggctgacc ttcctgtccc taaccaggat 
agtactatca atgaagcctt accaaaggac 
tccaacagtc tcaaagcaag accaagatcc 
gccatgaaaa ggggcgagga ccctcccacc 
agagcctcct ccttggatct gaataaagtc 
ggattgttac ccccaccacc tgcgctccct 
gtgtcggagg ccgagttact cccacagctg 
agtccagcaa agaaggatgt actgtattct 
ttcagaccag aaaaccaagc tacagaaaac 
gcttctgcgg caaccatgaa accgcatcca 
aaggccattc aaactgctat ccgcaaaaat 
aacagtgagc tccagcagca gctcaaggag 
caattggaac aacttcgtcc ggtcactgtg 
gggtgacctt gtctgccaag atctttcttt 
gatgtttgac tgtgtcaaaa gctgtgagca 
tgtatagact taaaaaaaaa aaaatagatc 



cgctttatga tgtcaaagaa tgatggtgag 60 
ggaactaagg ttcagattcc atatttaact 120 
gatgaggata aacaggaaac acagtctccc 180 
tccccgcctc attaccagag ggtgcccttg 240 
gcagaacaga tgcatccagc accttatgaa 300 
tcctcatcag ggggcccagg aaccaagccc 360 
ttttcagtgg agagggaact acaggataac 420 
acagaagaac agcgcgagta ctatgtcaat 480 
tctttcattt caggttctgt ggccaagaac 540 
gaactctcct atatatggga gcttagtgat 600 
gagttctgtg ctgcgtttca tctcattgtg 660 
ggcctccctc caactctgca gccagaatac 720 
gactgtcaat tatttgattc ttattctgag 780 
ttgaatcgga tggagacatc tgttaaagac 84 0 
gtaactagtg atgacaaaca agctttgaaa 900 
gtgtctgagg atccagcaac tcccaaggat 960 
agatcttact ctagcacctc catagaagag 1020 
ccgccacctc ggccacagaa aacccattcc 1080 
ttccagccca gtgtgccagc taccaagtca 1140 
ccaagacctt gtccatcaca gtctgaacaa 1200 
agcagagccc catcccaggc tgcagaaagt 1260 
cagccaccat caaagcccat tcgtaggaaa 1320 
caagagcctt ccactgctgc aagtgggcca 1380 
acagtccaaa agcagtcttc caaacagaag 14 40 
aaagaggcaa acgcagtgct ggctcggctg 1500 
gttcatcaag aacgaattgc attggaaaac 1560 
ttgtgacccc cccatggttc aagtgacagt 1620 
tgaatgtttt gaacccaact acttgtcata 1680 
gcaaaatata atccatatga ccttttctct 1740 
tttaattaag cggtcgcaag cttattccct 1800 



24/26 



